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Abstract 

In the fc-junta testing problem, a tester has to efficiently decide whether a given function 
/ : {0,1}" —> {0,1} is a fc-junta (i.e., depends on at most fc of its input bits) or is e-far from any fc- 
junta. Our main result is a quantum algorithm for this problem with query complexity O(^fkjs) 
and time complexity 0(n^/k/s). This quadratically improves over the query complexity of the 
previous best quantum junta tester, due to Atici and Servedio. Our tester is based on a new 
quantum algorithm for a gapped version of the combinatorial group testing problem, with an 
up to quartic improvement over the query complexity of the best classical algorithm. For our 
upper bound on the time complexity we give a near-linear time implementation of a shallow 
variant of the quantum Fourier transform over the symmetric group, similar to the Schur-Weyl 
transform. We also prove a lower bound of B(fc 1//3 ) queries for junta-testing (for constant s). 
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1 Introduction 


1.1 Quantum property testing 

Many computational problems are too hard to solve perfectly in any reasonable amount of time 
(especially if P ^ NP, as seems likely). Accordingly, much of theoretical as well as practical 
computer science is about trying to efficiently solve those problems in a weaker sense. Examples 
are trying to approximate the optimal solution, trying to solve the problem fast on average , trying 
to solve it fast on most instances , etc. A structured model for the latter is property testing. Here 
our goal is to test whether a given (usually very large) object / has a certain property V. Typically 
the hardest instances of the problem are the ones that are on the boundary, “just outside” of the 
property, where one needs to look at a large part of / to decide if it is in or out of the property. But, 
in many cases such instances tend to appear due to noise or other imperfections, and as such should 
not really be rejected. The setting of property testing excludes such instances: it assumes that the 
given instance / either has the property V, or is at least somewhat “far” from V (i.e., far from all 
instances that have property V. according to some suitable distance measure). This “promise” on 
the inputs makes many hard problems much easier, and many property testers have been found over 
the last two decades to efficiently test properties of very large objects, see for instance f33|. Note 
that a tester even allows us to conclude something about inputs that are outside of the promise: if 
a tester accepts input / with high probability, then / must be close to at least one element that 
has the property V. 

In this paper we focus on quantum algorithms for property testing. These are substantially 
less studied than classical algorithms, but quantum property testing has been receiving increasing 
attention in the last few years, both for testing properties of classical objects and for testing 
properties of quantum objects. See [H] for a recent survey. 

1.2 Group testing 

We first develop a new quantum algorithm for a version of the (combinatorial) group testing prob¬ 
lem^] Group testing was invented in World War II to efficiently identify ill soldiers [30]. Suppose 
n soldiers have each given samples of their blood, and up to k of them are ill. One way to identify 
the ill ones is to separately test each of the n blood samples. However, blood tests are expensive, 
and if k <C n then something much more efficient can be done. By combining parts of the blood 
samples of a subset S of all soldiers and testing the combined sample, we can determine whether at 
least one of the soldiers in S has the disease, at the expense of only one blood test. Using binary 
search we can then identify one ill soldier using O(logn) tests, and all k ill soldiers using O(fclogn) 
testsJ3 

Here we consider a “gapped” decision version of the group testing problem, which in its simplest 
form is the following: 

Gapped group testing (GGT). For some set A C [n], define /a: 2^ —* {0,1} by 

setting /a(* 5) = 1 iff S intersects A. Given the ability to query an /a where either 

1 To avoid confusion: while we use the established term “group testing” here, this is not a property testing problem 
in the sense described above, because we have no “in the property or far from the property” promise here. 

a O(fclogn) tests is essentially optimal for simple information-theoretic reasons: each test gives at most one bit of 
information, but by identifying the k ill soldiers we learn log (£) = f i(k log (n/k)) bits of information. There is a large 
literature that optimizes the constant factor and other aspects of group testing algorithms, see for instance |3U . 
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|A| < k or | A\ > k + d, decide which is the case. 


Note that the function /a is like a blood test, where A is the set of all ill soldiers and the input S is 
the set of the soldiers whose blood we include in the tested sample. The function outputs 1 exactly 
when at least one of the soldiers in S is ill. 

Recently, Belovs m showed that if |A| < k, then one can identify A with 0(\/k) quantum 
queries to f a■ Clearly this algorithm can be used to solve the GGT pro blem for d = 1. The 
randomized query complexity of this problem for d = 1 is &(k) (see Section 3.1 for references and 
proofs about the randomized case), so we have a quadratic quantum improvement over classical. 

Things get more interesting as d grows. In Section [3] we show a tight bound on the quantum 
query complexity of GGT of Q(^/k/d) for all d < k , while the randomized complexity of this 
problem is Q(k) for d < \fk and Q{(k/d ) 2 ) for d > Vk. We subsequently use this quantum 
algorithm as a subroutine for our junta testing algorithm, but we feel the GGT problem is quite 
interesting by itself as well, and may find applications elsewhere. We now mention several other 
reasons why our GGT algorithm is interesting. 


Fourth-power improvement. By considering our bounds on the complexity of the GGT prob¬ 
lem, we see that there is a quartic (fourth-power) quantum improvement in query complexity for 
the regime \fk < d < k. Most speed-ups obtained by quantum algorithms are either exponential 
(mostly, for computational problems from algebra or number theory) or quadratic or less (for al¬ 
gorithms based on Grover’s algorithm or its generalizations). In contrast, our algorithm provides 
a fourth-power speedup, which is quite surprising given that it is based on the OR function, for 
which the best-possible quantum speedup is only quadratic. 

Only a few other examples of such speed-ups are known: a cubic speed-up for exponential 
congruences by van Dam and Shparlinski [28], a quartic speed-up for finding counterfeit coins by 
Iwama et al. m , a cubic improvement for monotonicity testing on the hypercube by Belovs and 
Blais [12] (though that work was done after the work presented here), and a quartic improvement 
for learning an “exact-half junta” by Belovs m- Among these problems, a matching lower bound 
is known only for the last one. 

Very recently, several new separations were found for total Boolean functions: a quartic speed-up 
of bounded-error quantum algorithms over deterministic classical algorithms [3f, and a 2.5th-power 
speed-up of bounded-error quantum algorithms over bounded-error classical algorithms US- 

Direct use of adversary bound. We construct the algorithm from a feasible solution to the 
(dual) semidefinite program for the adversary bound. Reichardt et al. [[36] 07J 42J have shown that 
the adversary bound characterizes quantum query complexity up to a constant factor. This means 
that, in principle, any quantum query algorithm can be derived as a solution to this semidefinite 
program. However, the number of new quantum algorithms that have actually been obtained in this 
way (without any intermediate framework between the semidefinite program and the algorithm) 
remains fairly small. Moreover, the majority of these algorithms are developed in the learning graph 
methodology [8], while our algorithm is based on different ideas, borrowing from M- 

Robustness. Our algorithm still works if the action of the input function / a(S ) is not defined for 
some values of S. For instance, if |A| = k and S intersects A, the value of /a(S) can be anything: 
0, 1, or even undefined. The same is true if |A| = k + d and S 0 A = 0. We say that such S 
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are irrelevant variables. This property turns out to be very useful in applications of the GGT 
algorithm, in particular when we use it as a subroutine in our junta tester. 

As far as we know, such a property of a quantum algorithm was not previously studied or even 
explicitly defined. However, it turns out that this property is automatically satisfied for a quantum 
algorithm derived from the adversary bound if the latter satisfies some additional conditions. We 
obtain a similar behavior for the tight composition property of the adversary bound in the presence 
of irrelevant variables. 

Time-efficient implementation. Our algorithm is one of the few quantum algorithms derived 
from the adversary bound with a time-efficient implementation, i.e., one that is efficient in total 
number of gates as well as in total number of queries (in general, the time complexity of the 
adversary-derived algorithm can be exponentially large in the number of input bits to the problem). 
Other examples are the formula-evaluation algorithm of Reichardt and Spalek |49j and the algorithm 
for st-connectivity of Belovs and Reichardt [T3] . 

The time complexity of our algorithm is 0(ny/k/d), roughly n times its query complexity. This 
is probably the best one can hope for: the oracle takes an n-qubit input register, so it takes f2(n) 
gates just to touch all those qubits. Thus, any algorithm trying to beat our running time can only 
afford to change a small fraction of the string given to the input oracle. Also, realistic oracles will 
typically take time f2(n) to answer the query. 

The key to our time-efficient algorithm is an efficient, 0(n)-time, implementation of the quantum 
Fourier transform (QFT) on the linear space which we denote by M n . It is of dimension 2 n and has 
an orthonormal basis indexed by the set of all subsets of [n]. The symmetric group § n acts naturally 
on this space by permuting its basis elements, hence M n can be considered as an § n -module (a 
representation of § n ). 

Most of the previous work in this direction focused on the regular representation of S n , called 
the QFT over the symmetric group. The most efficient implementation of this kind is due to 
Kawano and Sekigawa [40] and can be implemented in depth 0(n 3 ), improving over [iT], 05, 39]. 
Our implementation, on the other hand, is close to the efficient quantum Schur-Weyl transform of 
Bacon, Chuang and Harrow [Si, though their algorithm is defined for another group, namely a 
product of a general linear group and a symmetric group. To the best of our knowledge, this is 
the first “algorithmic” application of this transformation m lists a number of applications of this 
transformation for quantum protocols). In order to ensure that the transform has all properties we 
need and to make our paper self-contained, we describe our construction in full detail instead of 
just referring to (slight modifications of) the construction of j5|l6]. 

1.3 Junta testing 

Our main result is about junta testing. Let / : {0, l} n — > {0,1} be a Boolean function, and J C [n] 
be the set of (indices of) variables on which the function depends. We say that / is a k-junta if 
| J| < k. Such functions are often studied, for instance in learning theory if most of the features are 
irrelevant for the concept that needs to be learned (e.g., in biology if only a few genes determine 
some biological property). We say that / is e-far from any k-junta if the normalized Hamming 
distance between / and g is at least e for every fc-junta g (i.e., / and g differ on at least e2 n inputs). 
The k-junta testing problem is: 
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/c-junta testing. Given the ability to query an /: {0,l} n — > {0,1} that is either a 
fc-junta or e-far from any fe-junta, decide which is the case. 

We would like to test this efficiently. The primary measure of efficiency is the number of “queries,” 
evaluations of /, which are usually the most expensive part of an algorithm. However, we will also 
consider time complexity later. 

Junta testing has been well-studied in the last decade, see m for a recent survey. Classically, 
the best known tester is by Blais m and uses 0(k log k + k/s) queries to /, quadratically improving 
upon an earlier tester of [32]. The best known classical lower bound is £l{k/e) [26] j^] 

The best quantum tester, due to Atici and Servedio [3], uses 0(k/e ) queries. It is based on 
Fourier sampling. This quantum tester is better than Blais’s classical tester by a log fc-factor (for 
constant e), but does not beat the best known classical lower bound, leaving open the possibility 
of an equally efficient classical tester. 

Our main result in this paper is a quantum tester with query complexity 0(y/k/e), which (up 
to logarithmic factors) quadratically improves over the previous best quantum junta tester and 
actually beats the known classical lower bound for the first time. We also give a time-efficient 
implementation. 

Main theorem (informal). There is a quantum fc-junta tester that uses ()(\/k/e) queries and 
0(ny/k/e) time (i.e., elementary quantum gates and query gates). 

Similarly to the GGT problem, this time complexity is the best that one could reasonably 
expect given our query complexity, because each query to / involves an n-qubit input register. 

Our junta tester is described in Section]!} The idea is the following (suppressing the dependence 
on e for simplicity). If / is far from any fc-junta, then it depends on some K > k variables, and 
together those K — k “extra” variables will have at least e “influence” (this will be quantified using 
the Fourier coefficients of /). We divide those extra variables in sets depending on their influence, 
and show that one of the following cases holds: 

1. For some integer j E {0,... ,0(logfc)} there are d > k/ extra variables, each of influence 
at least 2 3 /k. We thus need to distinguish the case where there are at most k influential 
variables from the case where there are at least k + d variables, each of influence roughly 
2Our quantum group tester, combined with a procedure to detect variables of influence 
roughly 2 i/k, can distinguish these two cases using 0(y/k) queries]^] 

2. There are many (S> k) variables of very low influence (< 1 /k). In this case a random subset V 
of 1/k of all variables has influence 0(1/A:) with probability close to 1. In contrast, if / is a 
fc-junta then with significant probability V will not contain any relevant variable, and hence 
have 0 influence. We can distinguish those two cases using O(Vk) queries. 

We then put together the testers for these special cases in order to get an 0{\fkje)- query tester 
that covers all cases. 

Tor the special case of non-adaptive junta testers (i.e., ones that choose all queries in advance, so the next query 
will not depend on outcomes of earlier queries), Servedio et al. [ST] very recently showed a slightly stronger lower 
bound, which is bigger than the upper bound of Blais’s tester for appropriate values of e. This shows that adaptivity 
helps (slightly) for classical junta testers. 

4 In the setting of classical testers, Garcia-Soriano [331 p. Ill] also noted “a striking resemblance between group 
testing and junta testing.” 
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Let us now briefly mention the lower bounds we obtain. As already noted by Atici and Serve- 
dio [3] and explained in Section [bj the classical lower bound approach for junta testing fails for quan¬ 
tum algorithms, because the corresponding instances can be easily solved quantumly in O(logfc) 
queries. Instead of this, in Section [6] we describe a different approach using reduction from the 
problem of testing image size of a function. This already gives a lower bound of H(fc 1//3 ) by the 
Aaronson-Shi lower bound for the collision problem [T]. We believe that the actual complexity of 
testing support size of a distribution is around £l(Vk), but proving this seems to require techniques 
beyond the state of the art in quantum lower bounds. 


1.4 Some remarks on organization 


The main results of the paper—quantum algorithms for the GGT problem and junta testing—are 
given in Sections [3] [5| For both problems, we give two versions of the algorithm: first, a query- 
efficient algorithm, and then its time-efficient implementation. Our organization of the paper is 
such that a reader only interested in the query-efficient algorithms can read Sections [3] and [4j 
skipping the more technical time-efficient implementations of Section [5j 


For the GGT problem, a quantum query-efficient algorithm is given in Section 3.2, and a time- 


efficient implementation of the same algorithm is given in Sections 5.1-5.3 


For the junta testing problem, we actuall y giv e two quantum algorithms: first, a query algorithm 


with complexity 0(\Jk/e log k) in Theorem 4.1, and then an algorithm with slightly worse query 


complexity 0{y/k/e) and time complexity 0{ny/kje ) in Theorem 


5.4 


2 Preliminaries 

We use [n] to denote the set {1,2,..., n}, and 2 A to denote the set of subsets of A. A /c-subset is 
a subset of size k. All matrices in this paper have real entries. If A is a matrix, A[z, j] denotes the 
element at row i and column j. A projector always stands for an orthogonal projector. We use II^ 
to denote the projector onto a subspace S. We use log and In to denote logarithms in base 2 and e, 
respectively. The notation stands for a finite held with q elements. 

We assume familiarity with basic probability theory. Let B(k,p) denote the binomial distribu¬ 
tion: Pr [B(k,p) = i] = (^)p*(l — p) k ~ l . We use 'H n (k,m) to denote the hypergeometric distribution, 
i.e., the distribution of [An [fc]| when A is sampled from all m-subsets of [n] uniformly at random. 
By X rsj £>, we denote that X is sampled from probability distribution B. 

2.1 Quantum algorithms 

Let us define quantum query algorithms. For a more complete treatment see [22]. A quantum 
query algorithm is defined as a sequence of unitary transformations alternating with oracle calls: 

U 0 -» O x -> U x -> O x -> • • • -> U T -i U t . (1) 

Here the UiS are arbitrary unitary transformations that are independent of the input. The input 
oracle O x is the same throughout the algorithm, and is the only way the algorithm accesses the 
input string x = (xj). The input oracle decomposes in the following way: 

° x = ® je[n] 
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where O x j is some unitary transformation that only depends on the symbol Xj. In this paper x 
will be a Boolean string, and we adopt the following convention: O x ,j = / if Xj = 0, and O x ,j = —I 
if Xj = 1, where I is the identity operator. 

The computation starts in a predefined state |0). After all the operations in are performed, 
some predefined output register is measured. We say that the algorithm computes a function F 
(with bounded error) if, for any x in the domain, the result of the measurement is F(x) with 
probability at least 2/3. The number T is the query complexity of the algorithm. The smallest 
value of T among all algorithms computing / is the quantum query complexity of F, and is denoted 
by Q{F). 

We will also be interested in time complexity (also known as gate complexity) of the algorithm. 
It is defined as the total number of elementary quantum gates (from some fixed universal set of 
gates) required to implement all the unitary transformations Uq, ..., Ut- 

One of our main algorithmic tools is amplitude amplification. This is encapsulated in the 
following result of Brassard et al. m Section 2], which generalizes Grover’s quantum search algo¬ 
rithm [35]. 

Lemma 2.1 (Amplitude amplification). Let A be some quantum procedure and S some set of 
basis states on the algorithm’s output space. Suppose that the probability that measuring the state 
A|0) gives a basis state in S is at least p. Then there exists another procedure B, which invokes 
A and A~ l 0(1/y/p) many times (we sometimes call such an invocation a “round of amplitude 
amplification”), such that the probability that measuring the state H|0) gives a basis state in S is 
at least 9/10. If, in contrast, the probability of obtaining a basis state in S when measuring A|0) 
was 0, this probability will still be 0 when measuring B |0). 

For time-efficient implementation of our algorithm, we need the following two results. 

Theorem 2.2 (Phase Estimation j?T, 27]). Assume a unitary U is given as a black box. There 
exists a quantum algorithm that, given an eigenvector \psi) of U with eigenvalue e , outputs a real 
number w such that \w — < 5 with probability at least 9/10. Moreover, the algorithm uses 0(1/5) 

controlled applications of U and t/ _1 and | polylog(l/<5) other elementary operations. 

Lemma 2.3 (Effective Spectral Gap Lemma [33])- Lei IIi and II 2 be two orthogonal projectors in 
the same vector space (not necessarily pairwise orthogonal), and R± = 2IIi — I and R 2 = 2 II 2 — I be 
the reflections about their images. For 5 > 0, let Ps be the projector on the span of all eigenvectors 
of R 2 R 1 that have eigenvalues e' e with |0| < 5. Then, for any vector w in the kernel of Hi, we have 

II-P 5 II 2 HI < ^IM|. 


2.2 Adversary bound 

Here we describe the dual adversary bound, the main tool for the construction of our algorithms. 

Let F: T> —>• {0,1}, with T> C {0, l} n , be a partial Boolean function. The (dual) adversary 
bound, ADV ± (I ? ), is defined as the optimal value of the following semi-definite optimization prob- 
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lem: 


minimize max > X~lz,zl 

z&V ^je[n ] 31 ’ J 

subject to Xjl x ,y} = 1 for all x,y E V with Fix) F(y); 

Xj y 0 for all j E [n], 


(3a) 

(3b) 

(3c) 


where Xj are T> x V positive semi-definite matrices. Recall that Q{F) denotes the bounded-error 
quantum query complexity of F. Then, we have the following important result. 

Theorem 2.4 ( [36l 1351 133] ). For every F, Q(F) = 0(ADV ± (F)). 


Because of Theorem 2.4, one may come up with a solution to the adversary bound instead of 


explicitly constructing a quantum algorithm. This is how we construct the algorithm in Section |3j 
The following “unweighted adversary bound” is a useful special case (and precursor) of the general 
adversary lower bound: 


Theorem 2.5 ([2j ). Suppose there is a non-empty relation R C F x (l) x F x (0) that satisfies 
(i) for each x E T’ _1 (l) appearing in R, there are at least m distinct y E F -1 (0) such that 

{x, y ) e R; 

(ii) for each y E .F _1 (0) appearing in R, there are at least m' distinct x E F _1 (l) such that 
0, y) E R; 

(Hi) for each x E F _1 (l) and each j E [n], there are at most l distinct y E F~ l { 0) such that 
(x, y) E R and xj y rj ; 

(iv) for each y E F _1 (0) and each j E [n], there are at most F distinct x E F _1 (l) such that 
(x, y) E R and xj y rj ; 


Then, the bounded-error quantum query complexity of F is Q 



The adversary bound is also useful for function composition. Assume F:T> —>• {0,1}, with 
V C {0, l} n , and, for any j E [n], let Gj be a partial Boolean function on mj variables. The 
composed Boolean function F o (G i,..., G n ) on m j variables is defined by 

(xn, ■ ■ ■ j x \ mi , • • •, x n i j • • • j 3Cnm n ) 1 ^ F(Gi(xn, • • • ? • • • 5 G n (x n i j • • • i %nrrin )). (4) 


where the composed function is defined on the input (in,..., x nmn ) iff the values of all Gj on the 
right-hand side of Q are defined, and the corresponding n-tuple belongs to V. 

Theorem 2.6 ([37]). We have 


ADV ± (F o (G 1} ..., G n )) < ADV ± (F) max ADV ± (G i ). 

ie[n] 

In particular, this theorem together with Theorem |2.4| implies 
Corollary 2.7 (Tight composition result). Q(F o (G \,..., G n )) = 0(Q(F) max Q(Gj)). 




That is, one can compose functions without the logarithmic overhead in query complexity 
that arises in the standard method of composition (which would reduce the error probability of 
the algorithm for the internal functions to <C 1/n by taking the majority-outcome of O(logn) 


independent runs of the algorithm). Notice, though, that Corollary 2.7 talks only about the query 
complexity of the resulting function, not its time complexity. Accordingly, even if the functions F 
and Gj can be evaluated time-efficiently, this does not imply that the algorithm for the composed 


function from Corollary |2.7| can be implemented time-efficiently. In order to get a time-efficient 
implementation, it is usually better to compose the algorithms for F and Gj using the standard 
method. 


2.3 Irrelevant Variables 

Consider the following motivating example. Let G i,., 
the “robust conjunction” of the functions Gi, 


., G n be partial Boolean functions. We define 


H(x) = /\ Gi(x) 

' uE|n] 


(5) 


as the partial Boolean function H given by the following: if G'i(x) = • • ■ = G n (x) = 1 (in particular, 
x is in the domain of all Gi), then H(x) = 1; if there exists an i for which Gj.(x) = 0 (in particular, 
x is in the domain of Gi), then H(x) = 0; and otherwise H(x) is not defined. Note that in the 
second case x may lie outside the domain of Gj for some (or even all) j ^ i. 

One interpretation of this expression, which we use in Section [4j is as follows. The function H 
is some test for x, and the Gi are sub-tests, which check for different possibilities of how H can 
fail. Thus, a positive input must satisfy all the sub-tests, whereas a negative x has to fail at least 
one sub-test G, but might give an ambiguous answer on other tests Gj. 

Classically, the above is a non-issue, since we can always apply the algorithm for Gi on an 
input x, even if that input is outside the domain of Gi —the algorithm’s output must still be either 
0 or 1. Quantumly, the situation is more delicate: strictly speaking, we cannot apply the textbook 
Grover search to evaluate H since the oracle in the definition of a quantum query algorithm is 
supposed to apply either I or —I on each input, but a quantum algorithm for Gi on an input x 
outside its domain may apply an arbitrary unitary transformation on its entire working space. 

In this particular case, H can be evaluated using amplitude amplification, Lemma |2.1[ instead 
of the usual Grover search. In Proposition 2.11| below, we extend this result to the case when the 
conjunction is replaced by an arbitrary partial Boolean function F. Additionally, we show how to 


generalize the tight composition result, Corollary 2.7, to this more general setting. In order to do 
this, we have to make a number of definitions. 

Definition 2.8 (Irrelevant variables). Let F : V —> {0,1} be a partial Boolean function with the 
domain V C {0, l} n . For each input x 6 T>, some input variables j 6 [n] may be called irrelevant, 
the remaining variables called relevant. This can be done in an arbitrary way, as long as the 
following consistency condition is satisfied: for any x, y E T> such that F(x) / F(y), there must 
exist a variable j relevant to both x and y and such that Xj ^ yj. 

Definition 2.9 (Evaluation with irrelevant variables). Evaluation of the function F with irrelevant 


variables is defined as in Section 2.1, with the difference that, for an input z 6 D, the input oracle 
may malfunction on irrelevant variables, i.e., O z j in ([2]) may be an arbitrary unitary if j is irrelevant 
for z. 
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Definition 2.10 (Composition with irrelevant variables). The composition F o (G ±,..., G n ) with 
irrelevant variables is defined as in Q but on a larger domain. Namely, the right-hand side of Q is 
defined iff there exists such that Zj = Gj (xj i ,... ,Xj mj ) for all relevant j. In particular, the 

value of Gj(xji, ..., Xj mj ) need not be defined for irrelevant j. The value of the composed function 
on this input is then set to F(z), and does not depend on the particular choice of 2 . 


We use the last two definitions as follows. 
efficient algorithm for testing juntas, using Corollary 2.12 below. 


Definition |2.10| is used in Section [4] to get a query- 
Thus we save a logarithmic 


factor, as described after Corollary |2.7[ Definition |2.9| is used in Section [5] to get a time-efficient 
implementation of the algorithm from Section [4] (again, see the discussion after Corollary 2.7). 

The following proposition is a special case of the construction in HU. 


Proposition 2.11. Let ( Xj ) be a feasible solution to the adversary bound 0 with objective value T. 
Call an input variable j is irrelevant for an input z E T> iff Xj\z,z\ = 0. With this choice of 
irrelevant variables, 


(a) There exists a quantum algorithm that evaluates the function F in the sense of Definition \2.£\ 
using 0(T) queries. 

(b) For arbitrary partial Boolean functions Gj, we have 


ADV ± (Fo (G lt ...,G n )) <T maxADV ± (Gj), 

ie[n] 


where F o (Gi,..., G n ) is as in Definition 2.10. 


It is easy to see that this choice of irrelevant variables satisfies the consistency condition of 
Indeed, if F{x) / F(y), then (3b) implies the existence of j with Xj\x,y\ / 0, and 


Definition 2.8 


since Xj 0, both Xj\x, x] and Xj \y,y\ are non-zero. 

The proof of point (a) is analogous to the proof of the upper bound of Theorem 2.4 and we will 
skip the details here. However, we will prove in Section 5.1 that our time-efficient implementation 
of the corresponding solution to the adversary bound has this property. 

We give a short proof of point (b) in Appendix [Aj Point (b) immediately gives the following 
variant of Corollary |2.7[ 


Corollary 2.12. Let ( Xj ) be a feasible solution to the adversary bound 0 with objective value T. 
We say that an input variable j is irrelevant for an input z E D iff Xj\z,z\ = 0. For arbitrary 
partial Boolean functions Gj, we have 


Q(Fo(G 1 ,...,G n )) =0(T ma xQ(Gj)), 

J'e[n] 


with the composition as in Definition 2.10 with this choice of irrelevant variables. 


Example 2.13 (AND). Let us return to the example in ([ 5 ]). Consider the AND function on the 
domain P = {z 6 {0, l} n | \z\ > n — 1}, where \z\ is the Hamming weight. A feasible solution to Q 
for this function is given by Xj = ifjifj, where ifj E is given by 




n -i/ 4 , if 1^1 _ n . 

< n 1 / 4 , if \z\ = n — 1 and Zj = 0; 
^0, if \z\ = n — 1 and Zj = 1 . 
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The objective value of this solution is y/n. Note that if \z\ = n — 1, then any variable j with Zj = 1 
is irrelevant for this input. This coincides with our definition of the “robust conjunction” at the 
beginning of this section. 

2.4 Fourier analysis 

We use Fourier analysis for arbitrary real-valued functions /: {0, l} n —> M. If / is Boolean, it is 
usually convenient to assume that its range is {±1} = {1,-1} rather than {0,1}. For a string 
s E {0, l} n , the corresponding character is a Boolean function y s : {0, l} n —> {±1} defined by 
Xs(x) = (—l) s ' x , where s ■ x = Yj SjXj denotes the inner product of s and x. We will often use the 
corresponding subset S C [n] instead of a string s E {0, l} n . 

Every function /: {0, l} n —> M, has a Fourier decomposition as follows: 

/0) = X! f( s )Xs(x), 
se{ o,i} n 

where f(s) = 2 ~ n Y x f( x )Xs(x) is the Fourier coefficient. The set {s | f(s) / 0} is called the 
(Fourier) spectrum of f. Parseval’s identity says that rhis transformation respects the norm: 
Ex [/(x) 2 ] = Y s f(s) 2 . In particular, for a Boolean /: { 0 , l} n —* {± 1 }, we have Y s /( s ) 2 = 1- 
For a subset S C [n], we define the influence of S on / by 

Inf s(f) = E /( T ) 2 - (6) 

T:Tn5^0 

If S consists of a single element j E [n] we write Infj(/), which is YIt- jeT /(^) 2 - 

An alternative (but equivalent) definition of influence for functions with range {±1} is as follows. 
Consider the following randomized procedure. Generate x E {0, l} n uniformly at random. Obtain 
y E {0, l} n from x by replacing, for each j E S, Xj by an independent uniformly random bit. Then, 
the influence Infg(/) is precisely twice the probability that f(x) /(y). Note that the influence 
Inf j(f) equals the probability that f(x) 7 ^ /(x® J ) when x is sampled from {0, l} n uniformly at 
random, where x®° denotes x with the }th bit flipped. We repeatedly use the following two obvious 
properties of influence: 

• Monotonicity. If S C T. then Inf$(/) < Infr(/). 

• Subadditivity. Infsuy(/) < Inf 5 (/) + Infr(/) for all S,T C [n]. 

The following lemma from [3j (implicit in the proof of their Theorem III.3) explains why influence 
is important in our junta testing algorithm. 

Lemma 2.14. If f is e-far from any k-junta, then for all W C [n] of size \W\ < k we have 

Inf [n]\w(/) > £■ 

Proof. Define a (not necessarily Boolean) function g: {0, l} n —> M by g(x) = Yscw f^)xs(x). 
Let h. be the Boolean function that is the sign of g. This h only depends on the variables in IF, so 
it is a fc-junta. Since / is e-far from any fc-junta, we have (using Parseval’s identity) 

* < < E[(/w _ 9(i)) 2] = - g(S)f = E f(Sf. D 

x s S£W 
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3 Gap version of group testing 


Our junta testers, which we describe in the next section, work by generating a random subset 
S C [n] and testing whether it intersects the set of influential variables of /. In this section 
we study a more abstract problem, where we assume that we have an oracle that answers this 
intersection-question with certainty. We believe this problem is of independent interest. 

For each A C [n], define the function Intersects^: 2^ —> {0,1} by 

Intersects^(S) = f lf ^ n 5 ^ (7) 

I 0, otherwise. 

In the standard version of group testing m one is given oracle access to the function Intersects^ 
for some A with |A| < k, and the task is to identify A. A natural variant of this problem is 
to compute or approximate the cardinality of A (a task already considered in the group testing 
literature [29]), and a decision version of the latter is deciding whether that cardinality is k or k + d 
for some d,k > 1. We define this formally next. 

Definition 3.1 (EGGT). Let k and d be positive integers, X consist of all subsets of [n] having 
size exactly k. and y consist of all subsets of [ n] having size exactly k + d. In the exact gap version 
of the group testing (EGGT) problem with parameters k and d , one is given oracle access to the 
function Intersects^ with A £ X U y, and the task is to decide whether A E X or A E y. 

We also study a relaxation of EGGT, in which we allow “false negatives” in the small-set case 
and “false positives” in the large-set case. This will be convenient for our applications. Luckily, 
algorithms for solving EGGT often turn out to also solve this harder problem. 

Definition 3.2 (GGT). Let k and d be positive integers. Define two families of functions 

X = {/: 2^ -> {0,1} | 3,4 e * VSC [n] : SH A = 0 =► f(S) = o} (8) 

and 

y={f: 2 [n] ->• {0,1} | 3B e y VS C [n] : 5 n B + 0 =► f(S) = l} . (9) 

In the gap version of the group testing (GGT) problem with parameters k and d, one is given oracle 
access to / G X U T, and the task is to decide whether / E X or / E T- 

It is easy to see that EGGT is a special case of the GGT, where the implications in ([8]) and © 
are replaced by equivalences. The GGT problem also includes as a special case the problem of 
distinguishing a function Intersects^(/) with |^4| < k from a function Intersects^(/) with |^4| > 
k + d. 

3.1 Randomized complexity 

In this section we show that the randomized query complexity of the gap version of group testing 
is 0(min{fc, 1 + ( k/d ) 2 }). 

The upper bound already follows from the existing literature on group testing: an 0(l + (k/d ) 2 ) 
bound appears in [25] (with a surprisingly elaborate analysis) and an O^klogk) bound appears 
in [H] and independently in [33l Section 5.3]. Strictly speaking those results only apply to the 
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EGGT problem. Because of that, and also for completeness, we include in Section [3. 1. 1| an upper 
bound for the more general GGT problem. 

The only lower bound for randomized complexity that we are aware of is 0(&) for the special case 
d = 1 due to Garcfa-Soriano [33], Section 5.3], who calls the problem “relaxed group testing.” We 


give two different lower bound proofs here for the general case of arbitrary d. The first (Section 3.1.2) 
is a short reduction from the Gap Hamming Distance problem in communication complexity which 
applies only to GGT. The second (Section 3.1.3) is a substantially longer but self-contained proof 
which applies to EGGT as well. 


3.1.1 Upper bound 

Theorem 3.3. For any k,d > 1, the randomized query complexity of the GGT problem with 
parameters k and d is 0(min{A; log k, 1 + ( k/d ) 2 }). 

Proof. We start with the easier upper bound of 0(1 + ( k/d ) 2 ). Take S C [n] by including each 
element independently at random with probability 1/k. If we are in the “small” case of GGT (as 
in ([8])), the probability of f(S) = 1 is at most 1 — (1 — 1 /k) k . If, on the other hand, we are in the 
“large” case of GGT (as in Q) then that probability is at least 1 — (1 — 1 /k) k+d . As these two 
probabilities differ by D(min{l, d/k}), by a Chernoff bound, we can distinguish the two cases by 
repeating this procedure 0(1 + (k/d) 2 ) times. 

We now prove the upper bound 0(klogk). The algorithm maintains a partition of [n], initially 
set to the trivial partition {[n]}. Each set in the partition can be either active or inactive, with the 
initial set [n] being active. We maintain the invariant that for all sets S in the partition, f(S) = 1. 
(We can assume f([n\) = 1 as otherwise we are clearly in the “small” case.) At each step of the 
algorithm, we take an active set S in the partition and repeat the following 10 log A; times. We 
partition S into Si and S 2 by taking each element of S independently to be in either Si or S 2 with 
probability 1/2. We then query / on both Si and If / returns 1 on both, then we replace S 
with Si and S 2 , and this ends the loop for S. Otherwise, if we did not manage to split S after 
10 log A: attempts, we declare S to be inactive and move on to another set in the partition. If at 
any point the partition contains at least k + 1 sets we stop and output “large.” Otherwise, if all 
(at most k) sets are inactive, we stop and output “small.” 

Notice that after each step we either add a set to the partition or declare a set inactive. There 
can therefore be at most 2k steps, and since each step involves at most 10 log A; queries, the total 
number of queries is at most 20fclog/c. The correctness in the “small” case is immediate from our 
invariant: the only way for the algorithm to output “large” is if there are k+ 1 sets in the partition 
on which / returns 1 but this cannot happen in the small case. So consider the “large” case as 
in © with some set B of size at least k + 1. We claim that with high probability, all inactive 
sets intersect B in at most 1 element, and hence it cannot happen that there are at most k sets in 
the partition and all are inactive. To see why, notice that if a set S intersects B in at least two 
elements, then there is probability 1/2 that when we split S into Si and S 2 , these two elements 
would end up in a different set. In this case / must answer 1 on both S\ and S 2 and S would be 
split. Therefore the probability for such an S to become inactive is at most 2~ 101ogfc = k~ 10 , which 
means that with high probability this bad event will never happen. □ 
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3.1.2 Lower bound for GGT via communication complexity 

Theorem 3.4. For any k,d > 1, the randomized query complexity of the GGT problem with 
parameters k and d is fl(min{A, 1 + ( k/d ) 2 }). 

Proof. Following the general approach to classical testing lower bounds of Blais et al. |18| , we will 
show a reduction from the Gap Hamming Distance (GHD) problem in communication complexity. 
In this problem there are two parties, Alice and Bob. Alice receives a bit string x E {0,1 } n and 
Bob receives a bit string y E {0, l} n . Their goal is to decide if the Hamming weight of x (By (i.e., 
the Hamming distance between x and y) is greater than n/2 + g or at most n/2 — g (and can behave 
arbitrarily for values in between). They are allowed the use of shared randomness and, for each 
input (x,y), need to output the correct answer with probability, say, at least 2/3. It was shown 
in [23] (see also [52] ) that for any 1 < g < n/2, any protocol solving this problem must use at least 
H(min{n, n 2 /g 2 }) bits of communication (and this is tight). 

Now let k,d > 1. It clearly suffices to prove the theorem for d < k, so assume that this 
is the case. The result now follows from the observation that any algorithm solving GGT with 
parameters k and d making q < k queries implies a protocol for GHD with parameters n = 2k and 
g = d using O(qlogk) bits of communication. In this protocol, Alice and Bob simulate the GGT 
algorithm given oracle access to Intersects^ where A C [n] is the support of x@y, using their shared 
randomness as coins to the algorithm. Whenever the algorithm performs a query S, Alice and Bob 
compute Intersects^ (S) by running an Equality protocol with error probability less than 1/(10 k) 
and communication O(logA) (see @21 Example 3.13]) to check if the restrictions of x and y to S 
are identical. It remains to notice that if the Hamming weight of x © y is at most n/2 — g = k — d, 
then Intersects^ E A, whereas if it is greater than n/2 + g = k + d, then Intersects^ E A- □ 

3.1.3 Direct lower bound for EGGT 

Theorem 3.5. For any k,d > 1. the randomized query complexity of the EGGT problem with 
parameters k and d is fl(min{A, 1 + (k/d) 2 }). 

Proof. We apply Yao’s principle. Let A and y be the uniform probability distributions over all 
subsets of [n] of sizes k and A + d, respectively, and T> be the uniform mixture of X and y. Denote 
m = min { [n/4J , \ n(k + d)/d ?j }. Assume that T is a deterministic decision tree of depth less than 
m/( 4? In k). We prove that, if n is large enough, the total variation distance between the outputs 
of T on A and A is at most 1 — H(l). As the success probability of a randomized algorithm with 
error probability <1/3 can be amplified to be larger than that same 1 — D(l) in a constant number 
of repetitions, the theorem follows. 

The main idea of the proof is this. We first show in Lemma |3.6| that queries to subsets of 
cardinality larger than 4^ In A; are going to return 1 with very high probability. As a result, such 
queries are useless and we can avoid them altogether, resulting in a decision tree T' in which all 
queries are to subsets of size at most 4? In A. We then complete the proof by showing that since 
T' looks at no more than m elements of the universe [n], it cannot distinguish A and y with high 
probability. 

Let us introduce some notation. Each vertex V of a deterministic decision tree T is characterized 
by its query Sy Q [n]. If V is at depth t — 1 , let Vi ,..., V* be the vertices on the path from the 
root to V, with Vi being the root and Vf = V. For i E [t — 1], let Si be the corresponding query 
and bi E {0,1} be the output of the oracle that leads from Vi to Vj+i. We say that the vertex 
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V is used on an input A C [n] if it is visited when applying the decision tree, i.e., if and only if 
Intersects^ (Si) = bi for all i G [t — 1]. Finally, the set of fresh variables queried by a vertex V 
consists of those variables not queried by its ancestors, namely, Fy = Sy \u ie[t- 1] Si- 

Lemma 3.6. For any decision tree T of depth T < k/2, there exists a decision tree T' of the same 
depth such that 

• for any vertex V of T ', the set of fresh variables, Fy, is of size at most In k; and 

• the probability that T and T' disagree on A ~ V is at most l/k. 

Proof. Consider a sequence of decision trees 

T 0 = T, 71,..., Tf-i, T t = T' 


defined in the following way. Assume that lt-i is already defined. For any vertex V of 7t-i at 
depth t — 1 such that |Fy| >4^ In k, replace it by a dummy query that always returns 1. It is clear 
that T' satisfies the first requirement. From Claim [3 Tt| below, it follows that the probability that 
Tt-i and Tt disagree on A ~ V is at most l/k 2 . As T < k/2 , it follows by a union bound that T' 
satisfies the second requirement as well. □ 


Claim 3.7. Assume that V is a vertex of %-i at depth t—1. If |Fy| > 4? In k, then Pr [An Sy = 
0 | V used on A] <l/k 2 . 


Proof. Let Si and bi be defined as above the statement of Lemma 3.6. Let P = {i \ bi = 1} be the 


locations where the path expects an answer 1 from the oracle. For each A on which V is used and 
each i G P, define hi(A) 6 [n] as the minimal element of A n Si. 

Assume that (cj)j g p is some possible list of the values of hi{A) (attained for at least one A). 
Notice that if A is chosen from X and we condition on V being used, and on hi(A ) = q for all 
iGP, then the distribution of A is uniform over all sets of the form C U B where 


C — {ci}i^p , 


and I? is a subset of [n] \ D of cardinality k — \C\ for 

D = {j € [n ] | 3i : (bi = 0 A j € Si) V (bi = 1 A j € Si A j < Cj)} . 
Therefore, under the same conditional distribution, we have 


rrp n 5 V = 0] < Pr[A n F V = 0] = (" -[ D / - j*l) / (" : £() 

4 In k \ fc//2 




n 


IT s 


< e 


—2 In k 


= l/k 2 


where we used \C\ < |P| < k/2. The case A E y is similar, and we get the same upper bound. As 
the conditional probability is at most l/k 2 in all possible cases, the probability in the statement of 
the lemma is also at most l/k 2 . □ 
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Therefore, in order to complete the proof of the theorem, it suffices to show that the total 
variation distance between the output distributions of T' on X and on y is 1 — 11(1). Notice that 
since the depth of T' is at most m/(4)( In k) and since each query is to a subset of cardinality at 
most 4^ In A;, the total number of elements of the universe [n] “addressed” by T' is at most m. It 
therefore suffices to prove the following stronger statement. For any deterministic decision tree B 
of depth at most m that makes queries of the form j 6? A, the total variation distance between the 
output distributions of B on X and on y is at most 1 — 14(1). 

To show this, we first claim that due to the strong symmetry properties of X and y, we can 
assume without loss of generality that for all j, all vertices of B at depth j — 1 perform a query 
on element j. Indeed, we can obviously assume that the queries along each path from the root 
to a leaf are distinct. Therefore, by applying a permutation on the universe [n], we can ensure 
that the query at the root is 1 and all other vertices perform queries in {2,... , n}. Crucially, this 
permutation leaves both X and y invariant. Similarly, we can now recur on each subtree of the 
root, and perform a permutation so as to make both vertices at depth 1 query the element 2. Since 
X is invariant under permutations of {2,..., n}, both when conditioned on containing 1 and on not 
containing 1, and similarly for y, this modification again does not affect the output distribution. 
Continuing in the same fashion we obtain the claim. 

As a result, we obtain that B depends only on An{l,..., m}. Since the distribution of the latter 
when A is chosen from X and from y is symmetric, it is enough to show that the total variation 
distance between the distribution of |An{l,..., m}\ when A is chosen from X and that distribution 
when A is chosen from y, is 1 — 12(1). This is shown in the following technical claim, for which we 
observe that the former distribution is precisely the hypergeometric distribution 7i n (k,m ) and the 
latter distribution is 'H n (k + d,m). □ 

Claim 3.8. For all integers k,d > 1, there exists N such that, for all n> N, the total variation 
distance between 'H n (k,m) and T~L n {k + d,m) is 1 — S7(l), where m = min {|_n/4j , \n(k + d)/d 2 \ }. 

Proof. Let p = min {1/4, (A; + d)/d 2 }. As is well known, T~L n {k, |_ pn\ ) converges to the binomial 
distribution B(k,p) as n —» oo. Thus, it suffices to estimate the total variation distance between 
B(k,p) and B(k + d,p). 

If d > k, then Pr \B{d,p) = 0] > (1 — 2 /d) d > e~ 2 /2 for large enough d. Conditioned on 
the event that the last d out of k + d coin flips in B(k + d,p) are 0 the two distributions B(k,p) 
and B(k + d,p) become the same, hence the total variation distance between the (unconditioned) 
distributions B(k,p ) and B(k + d,p) is at most 1 — 12(1). Thus we may further assume that d < k. 

Next, 

Pr [B(k,p) = £\ d k(k - 1) • • • (fc - 1+ 1) 

Pr [B(k + d,p) =£] ~ [ P) {k + d)(k + d — 1) • • ■ (k + d — l + 1) 

is a non-increasing function in £, hence, the total variation distance between B(k,p ) and B(k + d,p) 
is equal to the Kolmogorov distance between the two: 


max 

e 


Pr [£>(&; + d, p) > l\ — Pr \B{k,p) > (\ 


( 10 ) 


Let l be the value for which the maximum in (10) in attained. There are two cases. First, if 
£ > \{k T d)p ], then 

Pr [B(k,p) > £\ < Pr \B(k + d,p) > £\ < ^ 


16 







using the expression for the median of the binomial distribution [38] . hence (10) is at most 1/2. 

The second case is where i < \(k + d)p]. Defining T = minjfc, kp + 3^/kp(l — p)} and noting 
that £ < £', we have that 

Pr [B(k + d,p) > £\ > Pr \j3{k,p) > £\ > Pr [B(k,p) > /] = 9(1) , 

where the last equality follows from Slud’s inequality f53j (which applies here due to our choice of 
H! and since p < 1/4). Again, it follows that (10) is at most 1 — 9(1). O 


3.2 Quantum complexity 

The aim of this section is to show that the quantum query complexity of the EGGTproblem is 
0(l + ^ Jk/d ). Thus, when yfk < d < k, the quantum algorithm provides a quartic improvement 
over the randomized one. We start with a lower bound for EGGT, which implies the same lower 
bound for GGT. 


Proposition 3.9. The quantum query complexity of the EGGT problem with parameters k and d 
is 9( 1 + y/k/d). 

Proof. Take n = k + d. In this case y contains only one element, namely [n], and the corresponding 
function takes value 1 on every S except 5 = 0. Intuitively, one detects that A e X by finding an 
i 0 A, so the problem becomes the unstructured search problem of size n, where the d elements of 
[n] \ A are marked. Unstructured search requires 9(yTi/d) queries [ T9] . 

This intuitive argument can be made rigorous via the unweighted adversary lower bound (The¬ 
orem 2.5). We put the one input in y in relation with all (/) inputs in T, so m = 1 and m' = (/). 


For each fixed nonempty 5 C [n], there are different A 6 (^) such that An5 = 0 (these are 

the A e X where a query to 5 returns value 0, showing that A [n]). This number is maximized 
for |5| = 1, so £' < ("/T 1 ) (and £ = 1). The lower bound from Theorem 


2.5 


is 


n 


i mm' 

~w 


= n 



= fl[y/n/d) = 9(1 + y/k/d). 


□ 


The remaining part of this section is devoted to showing a matching upper bound on the 
quantum query complexity. In Section [5] we will show how to implement this algorithm time- 
efficiently. 


Theorem 3.10. There exists a quantum algorithm that solves the EGGT problem with parameters 
k and d using 0(>/l + k/d) queries. 

Proof. We prove the theorem by constructing a feasible solution to the semidefinite program Q 
and then applying Theorem |2.4[ For the EGGT problem, the adversary bound reads as follows: 


minimize max > XslA,AJ 

Aexuy^sc[n] ^ 


(11a) 

subject to ^2 Ws'|14,5] = 1 

for all A G X and B e T; 

(lib) 

S: An5=0 xor Bns=<6 



Xs h 0 

for all 5 C [n]. 

(11c) 
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Our feasible solution to © is an adaptation of the solution given in |10| for the task of finding 
the subset A. It is possible to give a solution to © in the style of |10] |. However, below we give a 
more direct construction resulting in matrices Xg of rank 1. 

Clearly, we may assume that n > k + d. Let S C [n], and s = |Sj. Ifs = 0ors>ra — k — d+1, 
we define Xg = 0. Ifl<s<n — fc — d+1, we define Xg = , where -0 is a vector indexed by 

sets A G X U y with entries 


a s , if A e X and A n 5 = 0; 
iffA} = < (3 Sl if A E y and \ A n S\ = 1; 
0, otherwise. 

Here a s and (5 S are some positive real numbers satisfying 

a s /3 s = f(n - k)( H $ ^ 1 


( 12 ) 


-i 


(13) 


The values of a s and /3 S depend on d and will be chosen later in order to minimize the value of the 
objective function (11a). Thus, ignoring repeated and zero entries, Xg is essentially a 2 x 2 block 
matrix of the form 

a 2 s a sfis 

,a s fi s fi 2 s 


X S = 


The proof now follows from the two claims below. 


□ 


Claim 3.11. The matrices Xg form a feasible solution to (lib) for the EGGT problem and any 
value of d. 

Proof. Fix A € X, B € T, and let i = \B \ A\ > d. Note that XgJH, BJ = 0 if the condition on S 
in the sum in (lib) is not satisfied. Next, 

n—k—l+l £^n—k—i\ ^ 


E x siA,m 


i 


E 


n — k tn-k- 1 \ n — k 

SC[n] s=l V s —1 ) 

where, for non-negative integers a <b, we define 


T(n — k — £,n — k — 1) , 


a ala — 1) a(a — l)(a — 2) • • • 1 

T{a,b) = l + - + gfi - -L + ...+ y A ; 


b 6(6-1) 


6(6 — 1)(6 — 2) • • • (6 — a + 1) 


Thus, to show that X)sc[r?,l x slA, BJ = 1 it remains to show that T(a,b) = (6 +1)/(6 — a + 1). This 
is easy to check by induction on a: the case a = 0 is trivial, and for the inductive step we have 


„. a m . a f b \ 6+1 

T(a,6) = l + -T(a-l,(,-l) = l + + ^-^ TT j=^-^ TI 


□ 


Claim 3.1 2. For each d, there exists a choice of a s and /3 S satisfying (13) such that the objective 
value (11a) is 0{^J 1 + k/d). 
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Proof. Fix a positive integer s < n — k — d + 1. For all A € X and B E T, we have 

— k N 


and 


E X S \A,A\ 

SC[ra]: |Sj=s 

E X s [B,B} = {k + d) 

SC[n]: |5|=s 




n ~ k ~ d \o2 

s- 1 1 Ps ' 


( 14 ) 


(15) 


We take a s and (3 S so that the values of (14) and (15) are equal. In particular, they are equal 
to their geometric mean, which, by ([13]), is 


v , (' i i t )(*+rf)C7fr') ^ [TTd 


s(n — k ) 


where we used that ( n ~ k ) = (E-E) an< ^ 

{k + d)( n ~ s k f d ) k + d (n-k- s W_1 


1 - 


s — 1 
n — k — 1 


d -1 


n—/c—1\ — 


(n-^nh 1 ) 


n — k \n — k — 1 


k + d 
n — k 


1 - 


s — 1 
n — k — 1 


d—1 


Let us denote m = n — k — 1. Using (16), we get that for all H G T U T: 

n—fc—d+1 

E*s[A4i= e E E 

SC[n] SC[n]:|S|=l s=2 5C[n]:|5|=s 

m+1 


< 


k + d 1 v 

m ^^ 


fc + d 


d m 


s=2 


(s — l)/m 


1 - 


s — 1 


m 


d— 1 


< 


k + d 


d 


+ 


r k + d 
lo V P 


(1 — p) d ~ l d p 


+ VkTdB(l/2, (d+ l)/2) = O , 


(16) 


where B stands for the beta function. Here, we substituted p = (s — l)/m, used monotonicity of 
the function p _1 (l — p ) d_1 , and applied a well-known asymptotic for the beta function. □ 


From (12) we have the following important observation. 

Observation 3.13. The feasible solution to the adversary bound © constructed in the proof of 


Theorem 3.10 has the following irrelevant variables in the sense of Proposition 2.11 


• If the input A is in X (i.e., |H| = k ), an input variable S C [n] is irrelevant if S n A 0. 

• If the input A is in y (i.e., \A\ = k + d), an input variable S C [n] is irrelevant if |5n A\ / 1. 


This observation will be used in Section [4j when we compose the algorithm of this section with 
an influence tester to get a query-efficient junta tester. In Section [5j we will show that the same 
algorithm also solves the GGT problem. 
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4 Quantum algorithm for junta testing 

The aim of this section is to prove the following theorem: 


Theorem 4.1. There exists a bounded-error quantum tester that distinguishes k-juntas from func¬ 
tions that are e-far from any k-junta, with query complexity 

O ^ \Jkfe log A:) . 

Suppose /: (0, l} n —> {±1} depends on a set J C [n] of K variables. Thus, the promise is that 
either / is a A-junta ( I\ < k), or / is e-far from any A-junta (we will call such / a “non-junta” for 
simplicity). The goal of the tester is to distinguish these two cases. 

At the lowest level of our algorithm, there is the following subroutine. 

Lemma 4.2 (Influence Tester). There exists an algorithm that, given a subset V C [n], accepts 
with probability at least 0.9 iflnfy(f) > 6 and rejects with certainty if Infy (/) = 0. The algorithm 
uses 0(^/l/S) queries and 0(n/\fS) other elementary operations. 

Proof. We pick x, y randomly as described in Section I below Q, and check if /(x) / /(y). By 
applying amplitude amplification for 0(l/\f&) rounds to amplify the basis states where /(x) ^ f(y), 
we obtain the lemma. □ 


The idea is to run the EGGT algorithm of Theorem 3.10 with the Influence Tester of Lemma 4.2 
as the input oracle. The complication is that we do not know what value of 5 we should specify: 
the Fourier weight of a non-junta can be either concentrated on few (though more than k ) variables 
with large influence, or scattered over many variables with tiny influence, and these cases call for 
different values of 6. We identify roughly log A: different types of non-juntas, and design a separate 
tester for each of them. A junta will be accepted (with high probability) by all of these testers, 
whereas a non-junta will be rejected by at least one of them. The description is given in Algorithm[lJ 
followed by the definition of the different types of non-juntas. 

Let us describe these types of non-juntas. For notational convenience, assume the first K 
variables are the influential ones, ordered by influence (of course, the tester does not know this 
order): 

Infi(/) > Inf 2 (/) > • • • > Inf A -(/) > 0 = lnf K+1 (f) = ■■■ = Inf„(/). (17) 

Our tester does not know the number K — k of “extra” variables if / is far from any A-junta. 


However, Lemma 2.14 implies that 


I n f{fc+i,...,#}(/) > £• 

Our tests are tailored to the following cases: 


(18) 


1. )Cj=°/f+i Infy (/) > e/2. This case is additionally split into |_log(200/c)J + 1 subcases: 


j e [n\ 


Inf;(/) > 


>k + T 


2^+3 log(400fc) 

Such an / is of the first kind , for this value of i. 


where i G (0,..., [log(200fc)J}; 


2. Y^f=k+\ Infy (/) < e/2. Such an / is of the second kind. 
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Algorithm 1 Quantum Junta Tester 

Input: Integer k> 0, real 0 < e < 1, and oracle access to /: {0, l} n —> {±1} 

Output: Accepts if / is a fc-junta; rejects if / is e-far from any fe-junta. 

1. Accept if all of the following [log(200fc)J + 2 testers accept, reject if at least one of them 
rejects: 

• Tester of the first kind with i E {0,..., [log(200fc)J}; 

• Tester of the second kind. 

Subroutine 1.1 Tester of the first kind 
Input: Integer £ > 0. 

Output: Accepts if / is a fc-junta, rejects if / is a non-junta of the first kind 
with this value of £. 

1. Run the EGGT algorithm of Theorem 
oracle: 

• On input S C [n], run Influence Tester on V = S and 5 = e/(2 £+3 log(400fc)). 

2. Accept if the EGGT algorithm accepts, otherwise reject. 

Subroutine 1.2 Tester of the second kind 

Output: Accepts if / is fc-junta, rejects if / is a non-junta of the second kind. 

1. Estimate the acceptance probability of the following subroutine with additive error .05: 

• Generate V C [n] by adding each i to V with probability 1/k independently at random. 

• Run Influence Tester with this choice of V and 5 = e/(4fc). 

2. Accept if the estimated acceptance probability is < 0.8, otherwise reject. 


3.10 with parameters k and d = and the following 


Note that / may be a non-junta of the first kind for many different values of £ simultaneously; 
an extreme example is if / is the n-bit parity function. 

Lemma 4.3. Every non-junta f satisfies at least one of the cases above. 

Proof. It is clear that any / satisfies the first or the second case above, so the only thing we need 
to show is that the first case is fully covered by its [log(200A:)J + 1 subcases. Assume / satisfies 
the first case. Denote e' = e/(8 log(400fc)) and consider the following intervals, which together 
partition the interval [0,1]: 

A = [ 0 g/ ^ 

00 ’ 2 L lo s(200fc)j J > 

where £ runs from [log(200A:)J to 1. Let 

Be = {j E {k + 1,..., 200/c} | Inf j (/) E Ae }. 

Each j is included in exactly one of the B(. Let also We = YljeB e Inf j (/) - Thus, We > e/2, 
because we are in the first case. Next, W, x < 200 k ■ e/(8 • 2L lo g( 200fc )J) < e/4. Thus, there exists 


2 <” 2 «-! 


A> — [A l] t 
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t E {0,..., |_log(200fc)J} such that Wg > e/(41og(400 k)). Then, either £ = 0 and \Bg\ > 1, or 


\B(\ > Wg 


2^+2 l O g(400fc) 


> T 


Also, all j E satisfy Infj(/) > e/ (2 £+3 log(400&)). By (17), all j E [A:] also satisfy this inequality. 
This means that / satisfies the first case with this value of £. □ 

Lemma 4.4. For each £ E {0,..., [log(200A)J}, Subroutine M accepts if f is a junta, and re¬ 
jects if f is a non-junta of the first kind for this value of l. Its query complexity can be made 
0(\/(kje) log k). 


Proof. The composition in Subroutine 1.1 is understood here in the sense of Definition 2.10 with 
the functions F and G defined as follows. 

Given a function h: 2^ —> 


The partial function F is the EGGT function from Definition 


3.1 


{0,1}, F(h) = 0 if h = Intersects^ with |A| = k, and F(h) = 1 if h = Intersects^ with \A\ = k + d, 
where Intersects^ is defined in Q. In all other cases, the value F(h) is not defined. 

For each S C [n], the partial function Gs is as defined in Lemma 4.2 Given a total function 
/: {0, l} n -4 {0,1}, define G s (f) = 0 if Inf s (/) = 0 and G s (f) = 1 if Inf s (/) > <5. If 0 < Inf 5 (/) < 
5, the value Gs(f) is not defined. 

The function evaluated in Subroutine 0 is the following restriction of the composed function 

F o (GQ, G{!}, G{2}, ■ ■ •, G[ n ]): 

/ i y F(G 0 (/), G {1} (/), G {2} (/),..., G [n] (/)), 

where, as the arguments of F, we have all possible 2 n functions Gs- The composition here is 


understood as in Definition 2.10 with the irrelevant variables of F given by Observation 3.13 


The query complexity of the s ubrou tine can be computed using Corollary |2.12 The complexity 


3.10 


is OU/kJ¥), as = O(k). The quantum query complexity 


4.2 


Thus, the total query complexity of the subroutine 


of the algorithm for F in Theorem 
of each Gy is G(y {2 e /e) log k) by Lemma 
is 0(y/(k/e) log k). 

Let us prove the correctness of the subroutine. Assume / is a non-junta of the first kind with 
this value of l. By definition, there exists A C [n] of size k + d such that for all j E A, Inf j(f) > 5. 
As the influence is monotone in S, Inf s(f) > <5 for all S that intersect A. By Observation |3 .13 all 
input variables S satisfying S n A = 0 are irrelevant, hence the value of the composed function is 1 
in this case. 

On the other hand, if / is a junta, there exists A C [n] of size k such that for all S C [n] 


satisfying S O A = 0, we have Inf5(/) = 0. By Observation 3.13 all input variables S satisfying 
Snd/0 are irrelevant, hence the value of the composed function is 0 in this case. □ 

From the proof of Lemma |4.4| it is clear why we need a separate tester for the second case. 
If 2 £ becomes u>(k), the complexity of Influence Tester still grows as 0(2 i /e), whereas the EGGT 
algorithm cannot use fewer than 0(1) queries. Our second tester (Subroutine |1.2[ ) does not use the 
EGGT algorithm, and relies on more traditional means. 


Lemma 4.5. Subroutine \1.S\ accepts if f is a junta, and rejects if f is a non-junta of the second 
kind. Its query complexity is 0{y/k/e). 
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Proof. The estimate of the query complexity of Subroutine 1.2 is straightforward. Let us prove 
its correctness. We will show that the inner procedure has acceptance probability < 0.75 if / is a 
/c-junta, and acceptance probability > 0.85 if / satisfies the second case. 

If / is a /e-junta then the probability that the set V does not intersect with the set J of (at 
most k) relevant variables is: 

(1- l/fc)l J l > (1-1 /k) k > 1/4, 

assuming k > 2. If V and J are disjoint, then the algorithm always rejects, hence, the acceptance 
probability is at most 0.75. 

Now suppose / is a non-junta of the second kind. For notational convenience, we still assume 
that the variables of / are ordered by decreasing influence as in ©• For j £ [n], let us define 


M,-(/) = 


0, 


if j < 200 k; 
otherwise. 


l^Es: 5n{200fc+l,...j}={j} ■ 

For S C [n], define Inf s (f) = Ej 6 sMj(/). 

This quantity satisfies two important properties. First, 0 < Inf s ( f) < Inf g(/) for all S C [n]. 
And second, it is additive is S, i.e., Inf g, JT ( f) = Inf <?( f j + Inf r ( f) for all disjoint S and T. Note 
that Inf s(f) is only subadditive in S. 

Next, as / satisfies the second case, Infj(/) < e/(200 k) for j > 200L. Hence, Infj(/) < e/(200fc) 
for all j £ [n]. Finally, 

200fc 

Inf [n](/) = Inf{200fe+l,...,X}(/) > Inf{fc+ 1 ,...,A'}(/) - 1 nf J'(/) ^ \ • 

j=k+l 


Consider the random variable Inf y( f) where V is as in Subroutine 1.2 Its expectation is 


l J - = E[Infy(/)] = ^Inf[„](/) > ^ , 


2k 


and its variance is 


a 2 = Var[Infy(/)] < ^ ^ nmxlnfj(/) ■ M[„](/) < 

j 


g < El 
200A: ^ “ 100 


Then, Chebyshev’s inequality implies 

Pr ) < e/4 k\ < Pr[|Infy(/) “ lA > A 1 / 2 ] < Pr[|Infy(/) - lA > 5cr] < ^ = 0.04 . 

Hence, with probability at least 0.96, we have Infy(/) > Infy (/) > e/4fc. If this is indeed the 
case, the influence tester in Lemma 4.2 accepts with probability > 0.9. Thus, the inner procedure 
accepts with probability at least 0.96 • 0.9 > 0.85 if / satisfies the second case. □ 

From Lemmas 4.4 and |4.5[ it is easy to see that Algorithm [l] is correct. If / is a junta, then all of 
the 0(log k ) subtesters accept. If / is a non-junta, then at least one of them rejects (and the output 
of the remaining ones is not defined). Thus, our algorithm is of the “robust conjunction” from ([5]). 
Hence, using Example 2.13| and Corollary 2.12| we get that the query complexity of Algorithm [I] is 

0(y/log k ■ y/(k/e)\ogk) = O[\fkje log k). 

This concludes the proof of Theorem |4.1| 
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5 Efficient implementation 


The main aim of this section is to prove that the algorithm from Theorem 3.10 can be implemented 
time-efficiently. Here by “time” we mean the total number of gates the algorithm uses, both the 
query-gates and all elementary quantum gates (from some arbitrary fixed universal set of gates) 
used to implement the unitaries in between the queries. 

Moreover, we will prove that our algorithm computes a function that has irrelevant variables 
as specified by Observation 3.13. For clarity, we will now explicitly describe the problem which 


arises from applying Definition 2.9 to the EGGT problem of Definition 3.1 


Definition 5.1 (QGGT). In the quantum gap group testing (QGGT) problem with parameters k 
and d, one is given access to an oracle Of satisfying the following properties. The oracle Of acts 
on two registers: the re-qubit input register I, and an arbitrary internal working register W. The 
oracle is in the block-diagonal form Of = ® 5C { 0 i}" 0/,S; where Of t s is a unitary operator on W, 
that gets invoked in Of when the value of the register I is S. We are promised that Of belongs to 
one of the following two families: 


am 




X = {Of I 3A £ X VS C [re] : S n A = 0 = 

=k Of t s 0)w = 0)w} 

(19) 

y = {Of \ 3B £ y VS C [re] : s n B ± 0 = 

> O/,s|0)w = - 0)w} • 

(20) 


The task is to detect whether Of £ X or Of € y. 

Theorem 5.2. There exists a quantum algorithm that solves the QGGT problem with parameters 
k and d in time 0(rei/l + k/d) using 0(y/l + k/d) queries. 

The time complexity of the algorithm is roughly re times its query complexity; as mentioned in 
the introduction, this is probably the best one can hope for. 

Note that the QGGT problem incorporates the usual quantization of the GGT problem from 


Definition 3.2, thus we have the following: 


Corollary 5.3. There exists a quantum algorithm that solves the GGT problem with parameters k 
and d in time 0(ny/l + k/d) using 0(^/1 + k/d) queries. 

However, the QGGT problem is more general than the GGT problem, the difference being that 
Of^s may be an arbitrary unitary in W when the premises in (19) or (|20l) do not hold. 

With Theorem 5.2 in hand, it is easy to show that Algorithm [T] can be implemented time- 
efficiently as well, with a slight increase in the number of queries. 

Theorem 5.4. There exists a bounded-error quantum tester that distinguishes k-juntas from func¬ 
tions that are e-far from any k-junta in time 0{n\fkje) using O^y/k/e) queries. 


Proof. It is easy to see from the proof of Lemma |4.5| that the time complexity of Subroutine 1.2 


is Oiny/k/e ). Unfortunately, it is hard to estimate the time complexity of the implementation 
of Subroutine m in Lemma |4.4[ because Lem ma |4.4| invokes Proposition |2.11[ b) to analyze the 
composition of quantum algorithms (Proposition 2.11 'b) upper bounds the query complexity of the 


5 One can also weaken the premise S 0 B 7 ^ 0 in ( |20| ) to 15 n B\ = 1. We chose this definition to make the QGGT 
problem more similar to the GGT problem. 


24 





















composition but not its time complexity). However, Algorithm [I] can be implemented to have time 
complexity 0(ny/k/ej as follows. 


We first reduce the error probability of each call to the Influence Tester of Lemma 4.2 to <C e/k 
by 0(log |) repetitions, and run it backwards (after copying the answer) to set the workspace 
back to its initial state; then run the QGGT algorithm on this oracle as if it’s errorless. Standard 
techniques show that the resulting variant of Subroutine 0 can be made to have error probabil¬ 
ity £ 1/3, and we do not need to invoke Proposition |2.1l) (b) anymore. The query complexity of 
Subroutine 1 1.1 1 has now gone up by a factor 0(log |), but its time complexity becomes 0(ny/k/e ), 
because it is the time complexity of the QGGT algorithm, plus its oracle-query complexity multi¬ 


plied by the time complexity of the amplified Influence Tester of Lemma 4.2 that implements one 
oracle call. 

The resulting variant of Algorithm [l] can thus be implemented in time 0{nyjk/e). □ 

5.1 Proof of Theorem 15.21 


It remains to prove Theorem 5.2 This is done by a (by now relatively standard) implementation 
of the dual adversary bound as in [48|. The analysis follows f32|), with the simplification that we 


have Boolean input and output (see also [9[ Section 3.4]). Our main innovation here is an efficient 
implementation of a specific reflection in Section |5.2| which we do by means of a new and efficient 
quantum Fourier transform. 

Before we proceed with the algorithm, 


Recall the QGGT problem as defined in Definition 5.1 


we have to make a minor modification to this definition. Due to technical reasons in Claim | 
below, we have to assume that Of t s not only satisfies (19) or (20), but is also a reflection. This 


is without loss of generality. Indeed, assume Of satisfies the original constraints of Definition 5.1 


Add a new basis element |1) to W, and assume that 0/,s does not change it. Let V be a unitary 
on W that maps |0)w to ^(|0)w + |l)w)- Denote by O'j- the following chain of operations: apply 

O/V, reflect about ^(|0)w + |l)w)> and apply V~ 1 Oj 1 . It is clearly a reflection, and it has the 
same decomposition O'j = ©sc{o i} n ®'f s as Of does. Moreover, it is straightforward to check 

that if Of is in X or y , then O'j is also in X or T, respectively. 

Our algorithm only uses the input register I, so we omit this subscript below. The register W 
is not written, but assumed to be in the state |0)w- We also add a new basis state |0) to I, and 
assume that Of |0) = —10) for all Of. 

The query-efficient algorithm in Theorem |3.10| was obtained by constructing the matrices Xg 
in (12) that depend on para meters a s and j3 s satisfying (13). The objective value (11a) is W = 
0(^1 + k/d ) by Claim 3.12 Also we define 7 = C\\/W for some constant C± to be determined 


later. 


Let A be the projector onto the span of the vectors 


n-k—d -\-1 


ipA= |0)+7 ^ 


a. 


E 


15 ) 


( 21 ) 


S=1 


SC[n]: |S|=s, SnA=0 


over all A € X, and R\ = 2A — I be the corresponding reflection. (In Section 5.2 we show how to 


implement R\ efficiently.) The QGGT problem is solved by Algorithm [2j where C is some constant 
to be defined later. 


Claim 5.5. Algorithm^is correct. 
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Algorithm 2 
Input: 
Accepts: 


Quantum Algorithm for the QGGT problem 
access to an oracle Of € X U y as in (19) or (20). 
iff 0/ 




G X. 


1. Prepare the state |0). 

2. Perform phase estimation on the operator U = OjR,\ with precision <5 = 1/(CW). 

3. Accept if and only if the phase-estimate is greater than 6. 


Proof. Let us first assume that Of G y. Let B e y be a corresponding element from (20), so 
\B\ = k + d. Define the following vector 


n—k—d-\-l 

« = 7io)- Y p* Y i 5 )- 

s=l SC[n]i |S|=s, |SOB[=l 


( 22 ) 


The squared norm of this vector is 

n—k—d +1 


I 112 2 

\u\\ = 7 


It — rc — Li-f L / 7 

+ Y. ( fc + d )( n ~_i ) /3 , 2 = 7 2 + E XslB,B]<C*W + W, 

s= 1 V s / 


where the second equality uses (15), and the last inequality uses that the objective value (11a) 
is W. 

We now show that u is an eigenvector of U = OfR\ with eigenvalue 1 (so the eigenvalue’s phase 
is 0). First, OfU = —u, because Of\S) = —(S') for all S occurring in (22) and we earlier already 
assumed Of\0) = — 10). Second, for all A E X we have 


n—k—d+1 

(Vt4,w) = 7-7 Y Y a. s f3 s = 7-7 Y x slABj = 0, 

S=1 SC[n]:|S|=s,SnA=0,|SnB|=l S: AnS=<D xor BnS=0 


where we used (12) and (lib). Hence, A u = 0 and R\u = (2A — I)u = —u. Therefore, Uu = 
OfR\u = u. 

Furthermore, the inner product of the normalized eigenvector u/||it|| and |0) is 


7 


> 


CiVw _ i 

s/C'(W + W - y/l + l/C‘( ' 


which can be made arbitrarily close to 1 by setting C\ to a sufficiently large constant. Since |0) 
is the starting state of Algorithm [2j the algorithm will (with probability at least 2/3 if we set C\ 
appropriately) produce a phase estimate that is at most 5, and correctly rejects Of G y. 

Now assume Of G X. Let A € X be the corresponding element from (19), so |A| = k. In this 
case, we will apply Lemma 


2.3 


with i?i = —R\ = I — 2A, R 2 = —Of (hence U = OfR\ = R 2 R 1 ), 
n x = / - A, n 2 = (/ - Of)/ 2, and w = if a- Indeed, since Aw = w, w lies in the kernel of Hi, 
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and we assume Of is a reflection, so the conditions of the lemma are satisfied. We have H 2 W = |0) 
because Of |0) = — 10), and Of\S) = (S') for all S in the support of if a- Also, 


n—k—d+l 


2 = 1 + 7 2 


E 

S =1 


n — k 


a 2 = 1 + 7 2 W = 1+CfW 2 . 


Since also W = £1(1), we have ||m|| = 0{W). Therefore, using Lemma |2.3[ the algorithm’s initial 
state |0) barely overlaps with eigenvectors of U = R 2 R 1 whose phase is (25)-close to 0: 

11^10)11 = ||p 2 *n 2 H| < = o ( i / c ). 

Hence the probability that phase estimation erroneously yields an estimate that is 5-close to 0 can 
be made less than 1/3 by choosing C a sufficiently large constant. Then Algorithm [ 2 ] accepts all 
Of E X with probability at least 2/3. □ 

As R\ can be implemented without exec uting the input oracle, the query complexity of Algo¬ 
rithm's 0(W) = 0{y /1 + k/d) by Theorem |2.2| To get the time complexity, the query complexity 
has to be multiplied by the cost of implementing U = OfR\. In Section 5.2 we show that R\ can 
be implemented in time O(n). Thus, Algorithm [ 2 ] can be implemented in time 0(n\J 1 + k/d). 

5.2 Efficient implementation of R\ 

This section is devoted to the proof of the following lemma, which shows that the reflection R\ = 
2A — I can be implemented efficiently, in time 0(n). For simplicity we assume n > 2k. This is 
without loss of generality, as we can extend the set [n] with dummy elements. Next, we identify 


|0) of Eq. (21) with |0), and absorb 7 into a s . To state the lemma, it is also more convenient to 


replace A in Eq. © by its complement, T = [n] \ A. 

Lemma 5.6. Let ao, oq,..., a n -k be arbitrary complex numbers and let A be the projector onto 
the span of the vectors 


n—k 


i’T = E at 


E i B > 

t=0 BCT: \B\=l 

over all T C [n] with |T| = n — k. Then, the corresponding reflection R\ = 2A — / can be 
implemented in time 0(n), up to an error in the operator norm that can be made smaller than any 
inverse polynomial in n. 


Representation theory background. In order to prove Lemma 5.6, we will use the structural 


properties implied by the invariance of the vectors t/t under permutations of [n]. We need some 
basic results from the representation theory of the symmetric group. These results are only used 
in this section. The reader may refer to a textbook on the topic such as [50|, or to the appendix 
of [10] . where we briefly formulate the required notions and results. 

Let § n denote the symmetric group on [n]. We consider (left) modules over the group algebra 
CS n . We call them S n -modules; they are also known as representations of S n . There is a 1-1 
correspondence between irreducible 8 n -modules and partitions (t±,...,tk) of n (where ti > t 2 > 
• • • > tk and t\ +12 + ■ ■ ■ + tk = n). Irreducible § n -modules are called Specht modules. 

A linear mapping 9: V —> W between two § n -modules is called an § n -homomorphism if, for all 
7T E § n and v E V, we have 9(nv) = 7 r(9(v)). The following result is basic for such homomorphisms: 
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Lemma 5.7 (Schur’s Lemma). Assume 6: V —> W is an §> n -homomorphism between two irreducible 
S n -modules V and W. Then, 6 = 0 ifV and W are not isomorphic. Otherwise, 6 is uniquely defined 
up to a scalar multiplier. 

Let M denote the complex vector space with the set of subsets of [re] as its orthonormal basis 
and with the group action ttA = ir(A), where ir e S n , A C [re], and it (A) denotes the image of the 
set A under the transformation n. We call {A}^c[n] the standard basis of M. 

The module M naturally decomposes into a direct sum M = ©?= _q Mg, where Mg is spanned 
by the subsets of cardinality The following lemma describes the decomposition of Mg into 
irreducible submodules Sg(t) (for different values of t), which will be isomorphic to the Specht 
module 5(f) corresponding to the partition (re — t,t) of n. 

In the formulation of the lemma and later we use 0 to denote disjoint union of subsets of [n], 
extended by linearity, so for example ({1} — {2}) ® ({3} — {4}) = {1,3} — {1,4} — {2,3} + {2,4}. 

Lemma 5.8. The § n -module Mg has the following decomposition into irreducible submodules: Mg = 
®£ =0 S}(t), where H! = min {£,n — £}, and each Sg(t) is isomorphic to 5(f). The submodule Sg(t) is 
spanned by the vectors 


vg{t , a, b ) = ({ai} - {&i}) <8> ■ ■ ■ <8> ({a t } - {b t }) 0 


E 


A 


AC[n]\{ai,...,at,6i,...,6t}: \A\=£—t 


(23) 


defined by disjoint sequences a = (ai,...,at) and b = ( 6 i,..., 6+) of pairwise distinct elements 
of [n]. The dimension of 5(f) is (”) - (^J. 

There is a unique (up to a scalar) S n -isomorphism between Sg(t) and S m (t). We can choose the 
scalar so that the isomorphism maps each vector vg(t, a, b) to the corresponding v m (t,a,b). 


The lemma follows from general theory [ 501 Sections 2.9 and 2.10]. The Appendix of JTOj contains 
a short proof, see also Remark 5.12 below. Figure [l] depicts the different subspaces involved in the 
decomposition of M. 



Figure 1: Decomposition of M 

6 In terms of [50], Mi is isomorphic to the permutation module corresponding to the partition (n — £',£') of n, 
where £' = min{f, n — £}. 
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Let V£(t,a,b) denote the normalized vector vg(t,a,b)/\\ve(t,a,b)\\, that is 


ve(t,a,b ) 



to — 2 A 
£~t) 


ve(t,a,b). 


( 24 ) 


Also, let St{t) —>• S^t) denote the isomorphism from Lemma 5.8 given by vt(t,a,b ) i-T- 

ve(t,a,b). This is a unitary transformation. 

For each t, we choose an orthonormal basis {et(t,x)} x of St(t). Also, let ee(t,x) = et(t,x), 
so that {eg(t, x)} x is an orthonormal basis of Sg(t). The precise choice of the basis of St(t) is irrele¬ 
vant, but it is important that the bases of Sg(t) for various l are synchronized via the isomorphism 
flt-tt- The set {ee(t,x)}i t x forms an orthonormal basis of M, which we call the Fourier basis. Let 
R(t) denote the submodule ©”=/ S^(t) of M. 


Claim 5.9. In the Fourier basis, any S n -homomorphism from M to itself is of the form 

Is(t)} where A t <8> I s ^ acts on R(t), A t is an (to — 2t + 1) x (to — 2t + 1) matrix, and I^u) denotes 

the identity operator on S(t). 


Proof. By Schur’s lemma, any § n -homomorphism maps each irreducible module to an isomorphic 
one. Hence, each R(t) is mapped to itself. Also, as is the only isomorphism between St(t) and 
Sg(t), we see by Schur’s lemma that a vector eg(t,x) is mapped into a linear combination of the 
vectors {e m (t, x)} m . Thus, in R(t), the homomorphism has the form A t (g) I sit) for some A t . □ 


The quantum Fourier transform of M. Let the register A have M as its vector space with 
the standard basis. Let also T and L be (to + l)-qudits (i.e., registers of dimension to + 1), and let 
n-qubit register B store indices x of the Fourier basis elements eg(t,x). The Fourier transform of 
M is the following unitary map, for which it is convenient to use ket notation: 


F: |t)xK)i-|z)B | ee(t,x)) A . 


(25) 


Note that while F is a unitary from a 2 n -dimensional space to a 2 n -dimensional space, it is conve¬ 
nient to use more than to qubits to represent the basis states (namely n + 2(log (to + 1 )] qubits). 

The Fourier transform segregates copies of non-isomorphic Specht modules of M by assigning 
them different values of t in the register T. For each Specht module S(t), the copies are labeled 
by £, which is the size (as a subset of [to]) of the standard basis elements of M used by the copy. 
Finally, the basis elements of S(t) are indexed by x, the precise choice of which is irrelevant for our 
application [[] The next theorem, which we prove in Section 
efficiently. 


5.3 


shows our QFT can be implemented 


Theorem 5.10. There exists a quantum algorithm with time complexity 0(n ) that implements the 
map from (25) for all choices of t, l and x for which the last expression is defined, up to an error 
in the operator norm that can be made smaller than any inverse polynomial in to. 


'For readers familiar with the usual formula of the QFT over S n , given by \g) yj^ffl/ 0 ) ffi j P(s)*j|®)li); 

(where g £ §„, p runs over all irreps of S n , and d p is the dimension of p): the register T in our algorithm is similar to 
p, L is similar to i, and B to j. The formula, however, does not exactly apply in our case, because we are performing 
QFT of the module M, not the regular S n -module, for which the latter formula applies. In other words, our QFT acts 
on the 2 n -dimensional space spanned by the subsets of [to], not the n!-dimensional space spanned by all permutations. 
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Decomposing A in terms of representations. The main observation behind our implementa¬ 
tion of R\ is that A is invariant under the permutation of elements, hence it is an § n -homomorphism 
from M to itself, and thus subject to the decomposition of Claim 5.9 In fact, it is not hard to 
obtain the matrices At in this decomposition. 

Claim 5.11. The image of A contains at most one copy of each S(t ) for t = 0, ..., k. In the 
Fourier basis, A has the following form: ©t=o0 &tWt)®I S (t), where wt is the normalized version of 
the (n — 2t + 1)-dimensional vector wt that is given by 


wtiq = at 


n — £ — t 
k — t 



(26) 


for l E {t ,..., n — k}, and wt\F\ = 0 for I E {n — k + l,...,n — t}. (If wt is the 0-vector, we assume 
that wt is the 0-vector as well.) 


Proof. We first assume that a n -k ^ 0. Because A is an § n -homomorphism, by Claim 5.9, A has the 


form 0 f A t ®Is(t) in the Fourier basis. We claim that A contains exactly one copy of each S(t) with 
0 < t < k, i.e., all A t are rank-1 projectors: indeed, the projection of A on M n _k is surjective, and 


as we know from Lemma 5.8, the latter does contain a copy of each S{t). Since S(t) has dimension 
(") — it- 1)> the direct sum of these copies of S(t) already has dimension X^=o (Cs) — (t-i)) = (&)■ 
On the other hand, A clearly has dimension at most Q(), hence A cannot contain more than one 
copy of any of these S(t). Thus, A actually has the form 0^ =o (wtWt ) ® Is(t) for some vectors wt- 
It remains to hnd the coefficients of Wf Take a vector v n -k(t,a,b ) E M n _k for some sequences 
a and b, and act on it with the linear transformation that maps a basis vector T E M n _k to the 
vector At E Af. The resulting vector u is clearly in the image of A. We claim that 


n—k 


U = 


i=t 


n — t — t 
k — t 


n—k 


ve(t,a,b) = S ^oit 


i=t 



2t (” _ ( ( 2T ) 


To prove the first equality of (27), consider a basis element A E A It for an I E {t,... ,n—k} and look 


at its coefficient in u. If A contains both a, and bi for some i, it appears in none of the "i/’r of which 
u consists. If A avoids both a* and bi for some i, then any coefficient it gets from ifr is cancelled 
by the coefficient it gets from ^rA{ai,6,;}) where A stands for symmetric difference. Finally, if A 
uses exactly one of each a*, bi, it appears in ?/V for choices of T, with coefficient equal to 

its coefficient in vt(t,a,b ) times at in each. This establishes the first equality of (27). The second 


equality in (27) follows immediately from (24). 


Now let v = (v x ) be the vector of coefficients of the representation of vt(t,a,b ) in the basis 
{et(t,x)} x , i.e., vt(t,a,b ) = Yl x v -v e t(t, x ) = ^(I^tI^lI^b)■ Then, by our choice of orthonormal 
basis, F~ 1 (vt(t, a , b)) = |t)jK)L1"^)B for I E {t, ..., n — k}, and from (27), we have 


n—k 


F~ l | u 


2t ) = ^2at[ U , ^ ^ \l ( n „ 2 f)F 1 (vt{t,a,b)) = |t) T N)i»B, 


e=t 


k — t 


-t 


where wt is defined in (26). As u is in the image of A and F At is u represented in the Fourier 


basis, wt must be proportional to wt- 
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Now consider the case a n _fc = 0. In this case, change a n -k to an arbitrary non-zero value, 
and perform the above calculations for the resulting projector A'. The result follows from the 
observation that the image of A is a projection of the image of A' onto ®g <n _ fc Mg. □ 


Remark 5.12. Note that (27) essentially proves the second part of Lemma 5.8 


Indeed, it shows 

that the transformation u n _fc(t, a, b) i —> vc(t,a,b) is linear (which is not obvious from (23)). It is 
clear that it is invariant under the action of § n , hence, it is an isomorphism between the copies of 
S(t) in M n -k and Mg. 


Implementing R\. Having the efficient implementation of the QFT of Theorem 5.10 (proved in 
the next section), it is straightforward to implement the translation R\ up to polynomially small 
error. First, we run the QFT of Theorem |5.10| in reverse, and obtain the representation of M in the 
Fourier basis. In this basis, by Claim 5.11 our task boils down to the reflection about the vector wt 
in the register L, where t is the value of the register T. We show how to implement this reflection 
below. The register B is not used in this reflection. After that, we run the QFT of Theorem 5.10 


Reflection about u)g. Reflection about wt is done using standard techniques. We describe the 
transformation for a fixed t, but a key point is that they can all be done in superposition (conditioned 
on t), so that the total time complexity of this step is the same as for a fixed value of t. 

We may assume u>t is a non-zero vector. To reflect about u>t, it suffices to give an 0(n)-time 
unitary Ut that maps \t)i to u>t■ the reflection about wt is then implemented by first applying Uf 1 , 
then reflecting about \t)i (which is easy), and then applying Ut- To implement this map Ut, we 
start in the state |t)i_, and run through all the values of £ from t to n — k. In the course of this 
process, we gradually generate states of the form 


wt{t] |i>L H- \-wt{£-lj \£- 1)l + VR. \£)\_, 


where R = 1 — 
transformation 


is the remaining weight to use. At each step, we perform the 
Vr \£)\_ ^ wt{£\ I-Ol + A r Rf\£ + 1) L , (28) 


I I ^ 

where R = 1 — X^=g|^iM| • To help with this, we keep two registers C and R. In C at step £>t, 
we store the number 

/ n—£—t\ / fn—2t\ 

\ k-t ) V V i-t ) 


Cl = 


\w t \ 


so that wt\£ 1 = olici by (26). In the register R, we store R. Both are stored as floating point 
real numbers with 0(logn) bits of precision. With these values, and knowing eng, the transforma¬ 
tion (28) is a routine polylogarithmic computation. 

The registers C and R are manipulated as follows. We start by computing \\wt\\ and then eg. 
We then upload the precomputed value of eg into C, set £ = t and set R to 1. After each step (28) 
we use 

ci -|_i n — £ — k 


ci 


y^n - £ — t)(£+ 1 — t) 


to replace eg with q + i in the register C, in polylogarithmic time. We also update R (in the 
straightforward way). At the end, when £ = n — k, we uncompute the values of C and R. 
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It suffices to keep track of cg_ up to A\ogn relative bits of precision, for an appropriate A. 
This can be achieved as q is only multiplied by a constant at each step, and is never added or 
subtracted. Then, R is accurate within an absolute error of 0{l/n A ~ l ) (to perform the entire 
reflection, we subtract O(n) values |u)t[i]| = ( aic^) 2 from R , with absolute error 0(l/n A ) in each 
of those values). This means that the transformation (28) produces amplitude for \£)\_ that is within 
of the correct amplitude Since this is achieved for every l and there are 0(n) 

values of £, the state that is produced by our algorithm is within Euclidean distance 0(l/n A ~ 3 ^ 2 ) 
of the correct one. The error analysis for the computation of \\wt\\ at the beginning of the algorithm 
is similar. 

Thus, each step takes time polylog(n), and there are 0(n) steps in total. Computing \\wt\\ and 
ct at the beginning and removal of the final values of C and R at the end also takes time O(n). As 
both applications of the QFT take time O(n) as well, the total running time of our implementation 
of the reflection R\ is 0(n). 


5.3 Quantum Fourier Transform of the module M 


In this section, we prove Theorem 5.10 Our algorithm is essentially the same as the quantum 
Schur-Weyl transform in 0E). However, the algorithm in mm is defined for another group (a 
product of a general linear group and a symmetric group), and we have to verify that the algorithm 
really performs the transformation in (25). Therefore we will give a complete description of the 
algorithm here. 

We will define our algorithm inductively in n, so we add a superscript n to the notations of the 
previous section in order to specify the value of n, e.g., M n , S n (t),v™(t, a , b ) and so on. 


Basis for modules S™(t). We now define the orthonormal basis {e”(t, x)} x of S™(t), for all 
n, t E N such that n > 2 1. As in j5; ( BJ [39] . our choice is a variant of the Gelfand-Tsetlin basis. 
The index x is encoded as a binary string of length n (not all binary strings correspond to basis 
elements though), and the definition of e”(i, x) is as follows. 

First, we define eo(0, 0 n ) as the basis element 0 of Sq(0). For t > 0, we define e^(t,x) in terms 
of e™ _1 (t, x). For that, we first need to understand the interplay between § n - and S n _i-modules. 

Since § n _i is a subgroup of S n , we can look at the action of S n -i on S n (t). Let S n (t)\. § 7l _ 1 denote 
S n (t), considered as an S n _i-module. The branching rule [50] Section 2.8] says that S n (t)\. § 71 _ 1 is 
§ n _i-isomorphic to the direct sum S n ^ 1 (t) © S’ n ~ 1 (t — 1). (If n = 2 1, the S n ~ 1 (t) term is absent in 
the last expression, as n — 1 < 2 1.) 

Let to be the i -isomorphism between 5” _1 (t) and the copy of S’ ri_1 (t) in 5" and 

T\ be the § n _j -isomorphism between — 1 ) and the copy of S n ~ l (t — 1 ) in g n _ 1 . We 

normalize To and ti so that they both become unitary operators. We define ef(t, ®0) = Toe” _1 (t, x) 
and e"(t, xl) = Tie”_ 1 (i — 1, x), where x runs through the basis elements of 5”^ 1 (t) and 1), 

respectively. Thus, {e™(t,x)} x is an orthonormal basis of because to and ti are unitary, and 

their images are orthogonal as vector spaces. 

Let us describe the isomorphisms to and ti explicitly. If n > 2 1, the isomorphism to is given 
by D”^ 1 (t,a, b) i —> v™(t,a,b). It is more convenient to define ti as the § n _i-isomorphism between 
1 (t) and the copy of 5 n_1 (t) in S™ +1 (t+ l)|s„_i- A non-normalized version of the isomorphism 
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is given by 


v t 1 {tj a, b) i—>■ ({ai}“{&i}) 


({«<}-(M) ® ({i} -{»})• (29) 


Indeed, this is a linear combination of linear transformations vf 1 (t,a, b) H > v™ + i(t,a,b) and 
a, b) i->- u” _1 (£, a, b) <8> {n}. It is also clearly invariant under the action of S n _i. r\ is 
the normalized version of this mapping, i.e., it maps a, b) into the normalized vector on the 


right-hand side of (29) 


Identity between basis elements. Recall that e^(t,x) = £t(t,x), where fit-te is defined 


after (24). 


Claim 5.13. We have 

eg(t,x 0) = 


n-i-t 
n — 2t 


e" (t,x) + 


~ t „n—1 


n — 2t 


ef_{(t, x) <g> {n} 


and 


e"(f + 1 , xl) = 


~ t „n—1 I’m n n-i-t 


e e (t, x) - 


e™_{ (t,x) <8 >{n}, 


(30) 


(31) 


n — 2t £ v "’ ' V n — 2t 

whenever the vectors on the left-hand side are defined. (If l = 0, t = t or n = t + £, the right-hand 


sides of (30) and (31) contain only one term.) Here we identify elements S C [n— 1] of the standard 
basis of M n ~ 1 with the corresponding basis elements of M n . 


Proof. To prove (30), it suffices to show that for any two disjoint sequences a = (ai,..., a<) and 
b = (&i, ..., bt) of distinct elements of [n — 1], we have 


i>e(t,a,b) = \j U n 2t 1 ( t ’ a ’ 6 ) + \j n- 21 


as (30) then follows by linearity. The latter equality follows from (23), as a uniformly random 


{£ — i)-subset A of [n] \ {ai,..., at, bi,..., bt} has probability {£ — t)/{n — 21) of including the 
element n. 


To prove (31), let us apply to the right-hand side of (29). The resulting vector is 


({<■!}-{&l}) 


({oi) - {l>t}) ® (l-t) 


E 


A — (n — £ — t ) 


E 


A 


\A\=£—t, n^A 


\A\=£—t, nGA 


because each A ^ n appears once from each term i 6 A of the sum in (29), and each A B n appears 


once from each term i cj A. By an easy calculation, the normalized version of this vector is 


■«?■'(*. o, <-) - b) ® {»}, 


and (31) again follows by linearity. 


□ 
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Algorithm 3 Quantum Fourier Transform of the module M n 

Given: Positive integer n, and registers A, B, T and L as described above. 
Action: Transformation |^)t|^)l|^)b ^ \ee(t,x))&- 


1. While n > 1: 

(a) Let T i — T — B n 

(b) Perform the unitary qubit transformation 


2 . 


|0 )ni " ‘ ' w + JJ-Lv, 

V n — 2t V n — 2t 

on B„, conditioned on the content |t)j|^)L °f the 
t < i < n — t.) 

(c) Swap B n and A n , let L <— L — A n and n <— n — 1. 
Implement the necessary transformation in the case n 


h-» 


l-t 
n — 2t 


| 0 >- 


n — t — t 
n — 2t 


| 1 > 


registers T and L (and on 2 1 < n and 


= 1. 


Quantum Fourier transform. 

of Theorem |5.10 


With Claim 5.13 in hand, it is easy to describe the algorithm 
Both A and B are represented as arrays of qubits Ai,..., A n and Bi,..., B n , 
respectively. A standard basis element S G M n is represented as (S')a, where A,; contains 1 iff 
i G S. A string x = (x,) G {0, l} n is represented as |x)b, where Bj stores x t . The transformation is 
described in Algorithm |3j 

The correctness of the algorithm can be proven by induction on n. The base case n = 1 is trivial. 
Next, for a fixed value of n, the first iteration of the loop in Step 1 performs the transformation 


|*)t|^)l| 0)B n ^ J 9 ~|*)t|1)l|0)a„ + J 9 . I*)t|^- 1>l| l>A n 

V n — It V n — It 

1 1 + l}-r|%|l)B n ^ ~ \J n _ 2t ~ 1 )l| 1}a p 

By the inductive assumption, the remaining iterations of the loop and Step 2 correctly perform the 
Quantum Fourier Transform of M n_1 , hence, the whole algorithm performs the Quantum Fourier 
Transform of M n due to Claim 15.131 

Using standard techniques, each iteration of the loop can be performed with sufficiently high 
precision in time polylog(n). Specifically, the one-qubit transformation for part (b) can be imple¬ 
mented (conditioned on t and i) with sufficiently high precision using polylog(n) gates from any 
universal set of elementary gates. The error (in the operator norm) of this algorithm can be made 
smaller than any inverse polynomial. The second step takes time 0(1). The total time complexity 
of the algorithm is thus 0(n). 


6 Quantum lower bound for junta testing 

Let us assume that e = 0(1). Tight classical lower bounds on junta testing [26) [18j are based 
on distinguishing a fc-junta from a function that depends on k + 0(1) variables. As noted by 
Atici and Servedio [4], this approach is doomed in the quantum setting because these two cases 
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can be distinguished in O(logfc) quantum queries as follows. For a function that depends on only 
k + 0(1) variables but is far from any A>junta, it follows from Lemma 2.14 that at least one of the 
0(1) “extra” variables has 11(1) influence. Hence there exists a subset S C [n] of k + 1 variables 
each having influence 11(1). Each of those k + 1 variables will occur in a Fourier Sample with 
constant probability, so the probability that a fixed variable from S is not seen in t Fourier Samples 
is exponentially small in t. By the union bound, after t = O(logfc) Fourier Samples, with high 
probability all k + 1 variables of S will have been seen and we can conclude the function is not a 
fe-junta. 

Instead of this, Atici and Servedio presented a different approach based on distinguishing a 
/c-junta from a function that depends on k + H(fc) variables. Using this technique, they proved an 
n(\/k) lower bound for a special class of non-adaptive quantum algorithms. 

In this section, we give an explicit description of the Atici-Servedio construction, and use it to 
prove a quantum lower bound for the junta testing problem. Consider the following problem. 


Definition 6.1 (Testing the image size). An image size tester, given oracle access to a function 
g: [m] —> [n], is required to distinguish whether the image of g is of size at most £, or g is e-far 
away from any such function. 


It turns out that a junta tester can be used to solve this problem. The connection is through 
the following ancillary function. 

Definition 6.2 (Addressing function). Assume that m is a power of two, and g: [m] —> [n] is 
a function. We define the corresponding addressing function f: {0, i} n + 1 ogm — > {±1} as follows. 
Interpret the input string x of / as a concatenation yz with y £ {0, l} n and 2 £ {0, l} logm = [777,]. 
Then, f(x) = (— l) y s(. z ). The variables in y are called addressed variables, and the variables in z are 
called the address variables. 


It is easy to see that a quantum query to / can be simulated by two quantum queries to g: one 
to compute g(z), and one to uncompute it. 

Lemma 6.3. For a function g: [m] —> [n] with m a power of 2, let f: {0, i} n + lo g m —> {±1} be the 
corresponding addressing function. Let £ > 1 be an integer and define k = l + logm. If the size of 
the image of g does not exceed l, then f is a k-junta. Conversely, if g is e-far from any function 
with an image of size at most £, then f is s'/2-far from any k-junta where e' = e — (logm)/fc. 

Proof. The first statement is obvious. So assume g is e-far from any function with an image of 
size at most £. We claim that g is also e'-far from any function with an image of size at most k. 
Indeed, if h is a function with image of size at most k, we can reduce its image to be of size at most 
£ by modifying it on at most a (logm)/A’ fraction of inputs corresponding to the “least popular” 
outputs. 

In order to show that / is eY2-far from any A:-junta, take an arbitrary A’-subset W C [n + logm], 
and any Boolean function h depending only on the variables in W. We want to show that / is 
s' /2-far from h. Indeed, by the previous claim, at least e' fraction of the inputs to g map to indices 
outside W n [n]. For any such z £ [m], and any y £ {0, l} n , exactly one x £ {yz,y® 9 ^ z} satisfies 
f(x) / h(x) (where x® 3 stands for x with the jth bit flipped). Hence, the distance between / and 
h is at least e'/2. □ 

Let us now state some corollaries of this result. First, we get an upper bound on the quantum 
query complexity of testing the support size. 
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Corollary 6.4. If login = o(T), the image size can be tested in 0(\J I/e log l) quantum queries. 

More importantly, however, we get a lower bound on the quantum query complexity of junta 
testing. This is based on the following well-known special case of the image size testing problem. 

Definition 6.5 (Collision Problem |21|). Let m be an even integer, and re G N. In the collision 
problem , one is given oracle access to a function g: [m] —> [n], that is either 1-to-l or 2-to-l. The 
task is decide which is the case. 


Brassard et al. [21] constructed a quantum 0(m 1//3 )-query algorithm for the collision problem. 
Later, Aaronson and Shi [1] proved a matching lower bound: 

Theorem 6.6. The bounded-error quantum query complexity of the collision problem is fl(m 1//3 ). 


If g: [m\ —» [re] is 2-to-l, then its image size is m/2 and the corresponding addressing function 
/ depends on only m/2 + logrre variables. On the other hand, if g is 1-to-l, then its image size is 
m and / depends on m + log?re variables. Moreover, it follows from Lemma 6.3 that / is 1/5-far 
from any (m/2 + logm)-junta if m is large enough. Combined with Theorem 


6.6 


we get 


Theorem 6.7. Every quantum tester that distinguishes k-juntas from functions that are 1/5-far 
from any k-junta with bounded error, needs to make 0(L 1//3 ) queries to the function. 


7 Conclusion and open problems 


In this paper we presented quantum algorithms for testing several well known properties of Boolean 
functions. Our main result is a quantum algorithm for the L-junta testing problem with query 
complexity 0(^Jk/e log k), and a time-efficient implementation of this based on a new near-linear 
time implementation of a shallow version of the quantum Fourier transform over the symmetric 
group. The query complexity of our tester is almost quadratically better than the best previous 
quantum tester and also almost quadratically better than the best-possible classical tester. 

The topics for future work include: 


1. Better lower bound for junta testing. The main open question is: what is the actual 
quantum query complexity of this problem? 

We believe that the true answer is around \Jk/e but it is quite chall engin g to improve our 

may give a lower 


6.3 


current lower bound of fl(A: 1//3 ). Nevertheless, we think that Lemma 

bound of ^(A; 1 / 2-5 ) for any 6 > 0. In particular, we think that it should be possible to combine 
the lower bound construction by Raskhodnikova et al. [16] with two recent developments in 
quantum lower bounds: Zhandry’s new machinery for the polynomial method [53], which he 
applied to the collision and the set equality problems [55], and Belovs’s and Rosmanis’s tight 
adversary lower bounds for the same functions m- 


2. Better upper bound. Regarding the upper bound, we wonder if the log k factor can be 
removed. This question is essentially equivalent to finding a solution to the adversary bound 
for the GGT problem that works for all values of d simultaneously. By this, we mean a 
feasible solution to © such that 


£ Xslfj 1 

SC[n] 


0(Vk ), 

o(f), 


if / is an Intersects^ function with |A| = k] 

if / is an Intersects^ function with | A\ = k + d > k. 


36 





Note that our current solution does not satisfy this property because we use different rescaling 
for each value of d. A different approach may be needed to obtain this property. 

3. Other applications of QGGT and our QFT. Several of our algorithms are based on a 
quantum algorithm for a group testing problem, QGGT, which we find quite interesting in 
its own right, as it shows a quartic quantum-over-classical speedup. We think there might be 
more applications for QGGT waiting to be found. 

Acknowledgements. 

We thank Ashley Montanaro for getting some of us interested in quantum junta testers in the first 
place, and for initial discussions; Eric Blais for some discussions about monotonicity testing, and in 
particular for his suggestion to study the classical complexity of the GGT problem; Mark Zhandry 
for answering a question about [55]; Rocco Servedio for sending us a copy of 15U; Alexander 
Russell for helpful discussions about the quantum Fourier transform over the symmetric group; 
Aram Harrow for answering a question about the Schur-Weyl transform; and Jeroen Zuiddam for 
some helpful comments. We thank anonymous referees for many helpful suggestions and bringing 
our attention to references [3 [33]. 


References 

[1] S. Aaronson and Y. Shi. Quantum lower bounds for the collision and the element distinctness 
problems. Journal of the ACM, 51(4):595—605, 2004. 

[2] A. Ambainis. Quantum lower bounds by quantum arguments. Journal of Computer and 
System Sciences, 64(4):750-767, 2002. Earlier: STOC’OO, arXiv:quant-ph/0002066. 

[3] A. Ambainis, K. Balodis, A. Belovs, T. Lee, M. Santha, and J. Smotrovs. Separations in query 
complexity based on pointer functions. arXiv: 1506.04719, 2015. 

[4] A. Atici and R. A. Servedio. Quantum algorithms for learning and testing juntas. Quantum 
Information Processing, 6(5):323—348, 2007. arXiv:0707.3479. 

[5] D. Bacon, I. Chuang, and A. Harrow. Efficient quantum circuits for Schur and Clebsch-Gordan 
transforms. Physical Review Letters, 97(17):170502, 2006. arXiv:quant-ph/0407082. 

[6] D. Bacon, I. Chuang, and A. Harrow. The quantum Schur and Clebsch-Gordan trans¬ 
forms: I. efficient qudit circuits. In Proc. of 18th ACM-SIAM SODA, pages 1235-1244, 2007. 
arXiv:quant-ph/0601001. 

[7] R. Beals. Quantum computation of Fourier transforms over symmetric groups. In Proc. of 
29th ACM STOC, pages 48-53, 1997. 

[8] A. Belovs. Span programs for functions with constant-sized 1-certificates. In Proc. of ffth 
ACM STOC, pages 77-84, 2012. arXiv: 1105.4024, 

[9] A. Belovs. Applications of the Adversary Method in Quantum Query Algorithms. PhD thesis, 
University of Latvia, 2013. arXiv: 1402.3858, 


37 



[10] A. Belovs. Quantum algorithms for learning symmetric juntas via the adversary bound. Com¬ 
putational Complexity , 24(2):255-293, 2015. Earlier: CCC'14 , arXiv: 1311.6777. 

[11] A. Belovs. Variations on quantum adversaries. arXiv: 1504.06943, 2015. 

[12] A. Belovs and E. Blais. Quantum algorithm for monotonicity testing on the hypercube. 
arXiv:1503.02868, 2015. 

[13] A. Belovs and B. W. Reichardt. Span programs and quantum algorithms for st-connectivity 
and claw detection. In Proc. of 20th ESA , volume 7501 of LNCS, pages 193-204, 2012. 
arXiv:1203.2603. 

[14] A. Belovs and A. Rosmanis. Adversary lower bounds for the collision and the set equality 
problems. arXiv: 1310.5185, 2013. 

[15] S. Ben-David. A super-Grover separation between randomized and quantum query complexi¬ 
ties. arXiv:1506.081106, 2015. 

[16] E. Blais. Testing juntas nearly optimally. In Proc. of 41st ACM STOC, pages 151-158, 2009. 

[17] E. Blais. Testing juntas: a brief survey. In Goldreich [3l], pages 32-40. 

[18] E. Blais, J. Brody, and K. Matulef. Property testing lower bounds via communication com¬ 
plexity. Computational Complexity, 21(2):311 358, 2012. Earlier: CCC’ll, ECCC: 2011/045. 

[19] M. Boyer, G. Brassard, P. Hpyer, and A. Tapp. Tight bounds on quantum searching. 
Fortschritte der Physik, 46(4-5):493-505, 1998. arXiv:quant-ph/9605034. 

[20] G. Brassard, P. Hpyer, M. Mosca, and A. Tapp. Quantum amplitude amplification and esti¬ 
mation. In Quantum Computation and Quantum Information: A Millennium Volume , volume 
305 of AMS Contemporary Mathematics Series , pages 53-74, 2002. arXiv: quant-ph/0005055. 

[21] G. Brassard, P. Hpyer, and A. Tapp. Quantum cryptanalysis of hash and claw-free func¬ 
tions. In Proc. of 3rd LATIN, volume 1380 of LNCS, pages 163-169. Springer, 1998. 
arXiv:quant-ph/9705002. 

[22] H. Buhrman and R. de Wolf. Complexity measures and decision tree complexity: a survey. 
Theoretical Computer Science, 288:21-43, 2002. 

[23] A. Chakrabarti and O. Regev. An optimal lower bound on the communication complexity 
of gap Hamming distance. SIAM Journal on Computing, 41(5): 1299-1317, 2012. Earlier: 
STOC’ll , arXiv: 1009.3460. 

[24] Y. Cheng. An efficient randomized group testing procedure to determine the number of defec¬ 
tives. Operations Research Letters, 39(5):352-354, 2011. 

[25] Y. Cheng and Y. Xu. An efficient FPRAS type group testing procedure to approximate the 
number of defectives. Journal of Combinatorial Optimization, 27(2):302-314, 2014. 

[26] H. Chockler and D. Gutfreund. A lower bound for testing juntas. Information Processing 
Letters, 90(6):301-305, 2004. 


38 








[27] R. Cleve, A. Ekert, C. Macchiavello, and M. Mosca. Quantum algorithms revisited. Pro¬ 
ceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 
454(1969):339-354, 1998. arXiv:quant-ph/9708016. 

[28] W. van Dam and I. E. Shparlinski. Classical and quantum algorithms for exponential 
congruences. In Proc. of 3rd TQC, volume 5106 of LNCS, pages 1-10. Springer, 2008. 
arXiv:0804.1109. 

[29] P. Damaschke and A. S. Muhammad. Bounds for nonadaptive group tests to estimate the 
amount of defectives. In Proc. of 4th COCOA, volume 6509 of LNCS, pages 117-130, Springer, 
2010. 

[30] R. Dorfman. The detection of defective members of large populations. The Annals of Mathe¬ 
matical Statistics, 14(4):436-440, 1943. 

[31] D. Z. Du and F. Hwang. Combinatorial group testing and its applications, volume 3 of Series 
on Applied Mathematics. World Scientific, 1993. 

[32] E. Fischer, G. Kindler, D. Ron, S. Safra, and A. Samorodnitsky. Testing juntas. Journal of 
Computer and System Sciences, 68(4):753—787, 2004. Earlier: FOCS’02. 

[33] D. Garcfa-Soriano. Query-Efficient Computation in Property Testing and Learning Theory. 
PhD thesis, CWI and University of Amsterdam, 2012. 

[34] O. Goldreich, editor. Property Testing: Current Research and Surveys, volume 6390 of LNCS. 
Springer, 2010. 

[35] L. K. Grover. A fast quantum mechanical algorithm for database search. In Proc. of 28th 
ACM STOC, pages 212-219, 1996. arXiv:quant-ph/9605043. 

[36] P. Hpyer, T. Lee, and R. Spalek. Negative weights make adversaries stronger. In Proc. of 39th 
ACM STOC, pages 526-535, 2007. arXiv:quant-ph/0611054. 

[37] K. Iwama, H. Nishimura, R. Raymond, and J. Teruyama. Quantum counterfeit coin problems. 
Theoretical Computer Science, 456:51-64, 2012. Earlier: ISAAC'10, arXiv: 1009.0416. 

[38] R. Kaas and J. Buhrman. Mean, median and mode in binomial distributions. Statistica 
Neerlandica, 34(1): 13-18, 1980. 

[39] Y. Kawano and H. Sekigawa. Quantum Fourier transform over symmetric groups. In Proc. of 
38th ISAAC, pages 227-234, 2013. 

[40] Y. Kawano and H. Sekigawa. Quantum Fourier transform over symmetric groups: Improved 
result. ACM Communications in Computer Algebra, 48(3):127-129, 2014. 

[41] A. Kitaev. Quantum measurements and the Abelian stabilizer problem. 

arXiv:quant-ph/9511026, 1995. 

[42] E. Kushilevitz and N. Nisan. Communication Complexity. Cambridge University Press, 1997. 

[43] T. Lee, R. Mittal, B. W. Reichardt, R. Spalek, and M. Szegedy. Quantum query complexity 
of state conversion. In Proc. of 52nd IEEE FOCS, pages 344-353, 2011. arXiv: 1011.3020, 


39 







[44] A. Montanaro and R. de Wolf. A survey of quantum property testing. Theory of Computing, 
2015. arXiv:1310.2035. To appear. 

[45] C. Moore, D. Rockmore, and A. Russell. Generic quantum Fourier transforms. ACM Trans. 
Algorithms, 2(4):707—T23, 2006. Earlier: SODA’Of, arXiv:quant-ph/0304064. 

[46] S. Raskhodnikova, D. Ron, A. Shpilka, and A. Smith. Strong lower bounds for approximating 
distribution support size and the distinct elements problem. SIAM Journal on Computing, 
39(3):813-842, 2009. Earlier: FOCS’07. 

[47] B. W. Reichardt. Span programs and quantum query complexity: The general adversary 
bound is nearly tight for every boolean function. In Proc. of 50th IEEE FOCS, pages 544-551, 
2009. arXiv:0904.2759. 

[48] B. W. Reichardt. Reflections for quantum query algorithms. In Proc. of 22nd ACM-SIAM 
SODA, pages 560-569, 2011. arXiv: 1005.1601. 

[49] B. W. Reichardt and R. Spalek. Span-program-based quantum algorithm for evaluating for¬ 
mulas. Theory of Computing, 8:291-319, 2012. Earlier: STOC’08, arXiv:0710.2630, 

[50] B. E. Sagan. The symmetric group: representations, combinatorial algorithms, and symmetric 
functions, volume 203 of Graduate Texts in Mathematics. Springer, 2001. 

[51] R. A. Servedio, L.-Y. Tang, and J. Wright. Adaptivity helps for testing juntas. In Proc. of 
30th IEEE CCC, volume 33 of LIPIcs, pages 264-279, 2015. 

[52] A. A. Sherstov. The communication complexity of gap Hamming distance. Theory of Com¬ 
puting, 8(8): 197-208, 2012. ECCC:2011/063, 

[53] E. V. Slud. Distribution inequalities for the binomial law. The Annals of Probability, 5(3):404- 
412, 1977. 

[54] M. Zhandry. How to construct quantum random functions. In Proc. of 53rd IEEE FOCS, 
pages 679-687, 2012. ePrint:2012/182, 

[55] M. Zhandry. A note on the quantum collision and set equality problems. Quantum Information 
& Computation, 15(7&;8):557-567, 2015. arXiv: 1312.1027, 


A Proof of Proposition |2.11| (b) 


The proof of this result is the same as the proof of Theorem 2.6 
of semi-definite matrices. 


We use the following easy properties 


Proposition A.l. Let A and B be positive semi-definite matrices of the same size. Then, the 
following matrices are also positive semi-definite: 

(a) obtained from A by simultaneously duplicating one of its rows and the same column; 

(b) the Hadamard (entry-wise) product Ao B. 
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First, trivially, 


/I 

0 


1\ 

0 


0 

Vi 


>- 0 . 


0 0 
1 1 
1 1 0 
0 0 1 / 

For j E [n], use Proposition A.l[ a) to duplicate rows and columns of this matrix, so that its rows 
correspond to the following four groups of inputs: z E F _ 1 (0), Zj = 0; z E -F _ 1 (0), Zj = 1; 
z E F _ 1 (l), Zj = 0; and z E i ? ~ 1 (l), Zj = 1. We can take the Hadamard product of this matrix 
and Xj, so we may assume without loss of generality that, for all x,y satisfying F(x) 7 ^ F(y), 


instead of (3b) we have the following stronger condition: 


X j{x,yj = 0 if Xj = yj 


and 


Y Xj[x,y\ = 1 . 

je [ra] 


(32) 


Let be an optimal solution to (3j) for Gj. Then, we define a feasible solution Xji to (3) for 
the composed function F o (G i,..., G n ) in the following way. 

Let x = (xn,..., xi mi , ... ,x n i,... ,x nmri ) be an input in the domain of F o (G i,... ,G n ). 
Thus, there exists z in the domain of / such that Zj = Gj (xj \,..., Xj mj ) for all relevant j. Fix this 
choice of 2 to x. Assume that y = (y\ i,..., yi mi , • • • , y n i, ..., y n m n ) and t are defined analogously 
to x and z, respectively. Then, define 

2 /] = X j\ z i • • • i x jmj)i {lljli ■ ■ • > Z/jmj)]) 

where we extend Xy with zeroes for all inputs outside the domain of Gj. 




First, Xji F 0 due to Proposition [AT Next, if F(z ) 7 ^ F(t), then 


Y X nl x ^y} = Y Xjfc’tl Y x i J) K x FT--’ x j™-i)’(yF’---iyjm j )\ = Y x ^ z ’ t ^ = 1 - 

ji: Xji^Uji ie[n] i- je[n] 


Here we used that if Xj [z, f] 7 ^ 0, then j is relevant for both z and t, and Gj(xji, ... ,Xj mj ) 7 ^ 
Gj(yji,... ,yjm,j ) because of ([32]). Finally, we have 

Y x jil x , x l = Y Xjl z i ■ i x jrnj), < Q max AD X ± (Gj). 

ji Mn] i je[n] 
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